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Abstract 


Consider  real  unreduced  n  x  n  symmetric  tridiagonal  matrices  with  all  sub¬ 
diagonal  entries  equal  to  one.  Such  a  matrix  has  distinct  real  eigenvalues  and  Parlett 
conjectured  that  they  must  differ  by  at  least 

2  (u>  -  i-J’/u,"  = 

where  uj  is  the  spread  of  diagonal  entries  and  should  exceed  4. 

We  show  that  the  conjecture  is  true  for  u  >  3 n,  but  fails  if  u/n  is  too  small. 
The  proof  rests  on  two  types  of  lower  bound  for  eigenvalue  separations  and  on  detailed 
estimates  of  the  ratios  of  entries  of  eigenvectors;  one  set  for  componentwise  ratio  of 
two  different  vectors,  another  for  adjacent  entries  in  the  same  vector.  These  results 
have  some  independent  interest. 
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Chapter  1 

Preliminaries 


1.1  Introduction  and  Summary 


For  over  one  hundred  years,  the  eigenvalue  problem  has  been  investigated  by  math¬ 
ematicians,  physicists,  and  engineers.  Scientists  explored  the  characterization,  location, 
perturbation  and  computation  of  eigenvalues,  to  name  a  few  topics.  This  thesis  is  devoted 
to  the  separation  of  eigenvalues.  We  will  find  the  minimum  gap  between  eigenvalues  over 
an  interesting  class  of  tridiagonal  matrices. 

We  consider  unreduced  nxn  symmetric  tridiagonal  matrices  with  all  subdiagonal 
entries  1.  The  typical  matrix  may  be  written 

/  1  1  ...  1  \ 


Jn(a)  =  tridiag 


a(l) 


a(2) . a(n)  , 


(1.1) 


\  1  1  ...  1  / 


where  a  =  (o(l), o(2),. ..,o(n))  denotes  the  diagonal  and  a(k)  G  R. 

J( a)  has  distinct  real  eigenvalues  and,  in  [10],  Parlett  conjectured  that  they  must 
differ  by  at  least 

2(w-  -)2/w"  =  0(w2-"), 

where  ur  =  max  a(k )  -  min  a(k )  is  the  spread  of  diagonal  entries  and  should  exceed  4. 

l<Jt<n  l<i<n 

Our  contribution  is  to  settle  this  conjecture.  It  is  true  for  >  3n  and  probably  for  some 
smaller  ratio  w/n.  Nevertheless  some  condition  on  u ;/n  is  necessary  because  we  have  counter 
examples  for  u>  >  4. 

As  will  be  shown  later,  the  problem  of  finding  the  minimal  separation  of  eigenvalues 
for  given  n  and  diagonal  spread  w  is  a  piecewise  smooth  constrained  optimization  problem. 
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The  first  task  is  to  find  matrices  that  satisfy  the  Kuhn-Tucker  conditions,  but  it  is  much 
harder  to  deduce  when  these  matrices  are  global  extrema.  Indeed,  when  u  is  not  large 
enough  the  rival  configurations  also  satisfy  the  Kuhn-Tucker  conditions  and  appropriate 
the  global  minimum  to  themselves. 

It  is  not  immediately  apparent  but  the  eigenvalue  gap  does  not  start  to  get  small 
until  a  few  eigenvalues  are  clearly  separated  from  the  others.  This  observation  directs  our 
attention  to  the  case  w  >  4. 

The  proof  rests  on  detailed  estimates  of  the  ratios  of  entries  of  eigenvectors;  one 
set  for  componentwise  ratio  of  two  different  vectors,  another  for  adjacent  entries  in  the  same 
vector.  These  results  have  some  independent  interest. 

The  motivation  leads  us  to  the  definition  of  minimum  eigenvalue  separation,  or 

gap- 

Definition  1  Let  {A,}  be  the  set  of  eigenvalues  of  a  matrix  A,  then  the  gap  of  A’s  spectrum 
is 

g(A)  :=  min  |Aj  -  A,j. 

«*> 

In  section  1.2  we  survey  some  of  the  voluminous  literature  on  gap.  We  also  include 
results  on  the  first  excitation  energy  of  the  Schrodinger  operator.  We  show,  in  section  1.3. 
that  the  minimal  gap  is  invariant  under  shift,  duality,  reversal  of  the  diagonal,  and  their 
combinations.  In  section  1.4  we  normalize  and  formulate  our  minimization  problem,  and 

give  a  complete  description  of  the  answer.  When  u  is  large  enough,  wj  =  . 0.^) 

becomes  the  minimizer.  We  compute  the  gradient  of  each  eigenvalue  separation  with  respect 
to  the  diagonal  in  section  1.5.  Section  1.6  includes  formulas  for  the  entries  of  normalized 
eigenvectors  and  their  symmetric  properties. 

Chapter  2  is  devoted  to  our  extremal  matrix  J(wj).  Its  eigenvalue  equation  and 
eigenvectors  can  be  expressed  in  terms  of  Chebyshev  polynomials.  By  symbolic  computa¬ 
tion,  we  can  derive  the  asymptotic  expansions  of  two  dominant  eigenvalues  and  hence  their 
separation.  Precise  bounds  are  included  as  well.  We  also  have  the  analogue  for  the  dual 
matrix  7(wJ). 

Chapter  3  explores  the  ratios  of  the  entries  of  eigenvectors.  Let  A„  and  An_j  be 
two  largest  eigenvalues  with  corresponding  eigenvectors  vn  and  vn_j.  We  prove  that  the 
ratio  is  strictly  monotonic  as  k  increases.  It  follows  that  V(A„  -  An_i)  obeys  certain 
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sign  pattern.  Ashbaugh  and  Benguria’s  Comparison  Theorem  [l]  is  an  easy  consequence. 
We  also  have  inequalities  for  the  ratios  and  and  establish  convexity  for 

the  components  of  an  eigenvector. 

Chapter  4  solves  our  minimization  problem  when  the  size  of  diagonal  spread  is 
large  enough.  Suppose  the  eigenvalues  are  in  increasing  order  Aj  <  A2  <  •  •  •  <  A„.  We 
obtain  a  lower  bound  for  each  At+j  —  A,,  which  indicates  the  middle  ones  always  have  bigger 
lower  bounds  than  A2  -  Aj  and  An  -  A„_i.  Hence  the  middle  separations  can  not  compete 
with  the  end  ones.  By  duality  we  only  need  to  consider  An  —  An_j .  From  the  refinement 
of  Sun’s  theorem,  the  trace  of  the  minimal  matrix  must  be  small.  In  view  of  the  Kuhn- 
Tucker  condition,  the  sign  patterns  of  V(An  —  A„_i)  yield  all  the  possible  local  minimizers. 
Then  we  use  the  ratio  inequalities  from  the  previous  chapter  to  eliminate  the  unsymmetric 
minimizers.  Finally  we  prove  J(wj)  indeed  minimizes  An  -  A„_j  locally.  We  sketch  the 
whole  process  and  state  the  main  theorem  in  section  4.5. 

1.2  Review  of  the  Literature 

The  gap  of  a  matrix  indicates  whether  its  spectrum  is  well  separated.  Hence  the 
separation  of  eigenvalues  is  closely  related  to  the  difficulty  of  eigenvalue  computation.  In 
fact  the  number  of  iterations  of  many  methods  depends  on  the  size  of  the  gap. 

Wilkinson  [15,  p.  308]  studied 

W2m+1  =  J2m+i(a)  with  a{k)  =  |m  +  1  -  k\  for  k  =  1, 2, . .  .,2m  +  1. 

The  largest  two  eigenvalues  of  W2m+J  differ  by  roughly  (m!)-2.  For  example  A20  and  A21 
of  Wn  agree  for  their  first  fifteen  decimal  digits!  Nevertheless  this  is  not  the  smallest  gap 
for  such  matrices. 

Wang  [14]  generalizes  Wilkinson’s  matrix  W2m+1  to 

W2m+i(<f)  =  Jim+ii*')  with  a'(k)  =  |m  +  1  -  fc|  d  for  k  =  1,2, . .  .,2m  +  1. 

Then  he  gives  upper  and  lower  bounds  for  its  gap. 

Theorem  1.1  (Wang)  For  d  >  0.92  , 

1  .....  ,  ...  30m2 

2m(m!)2d2m_1  <  ^  2”,+i(<*))  <  (m!)2^"*"1  ‘ 
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Sun  [12,  Theorem  1]  considers  real  tridiagonal  matrix 


Tn  =  tridiag 


bi  bi 

a\  a2 

C  2  C3 


bn- 1 


with  bk-iCk  >  0  for  k  =  2, 3,  • • ,  n. 


and  gives  a  lower  bound  for  gap. 
Theorem  1.2  (Sun)  For  n  >  2, 


ff(Tn)  >  2  (  n  bk-ick)$  p2~n  where 

k=  2 


p  =  min{pi,max(p2,p3)},  Pi  =^n-  An 

1  n  1  " 

Pi  =  - — -(nl„  -J2Qk),  P3  =  - — ^(Y]ak-  «Ai), 

n  -  2  “  n-2  r! 


fc=i 


=  max  {a*  +  |6fe|  +  |ck|}, 

\<k<n 

and 


fc= l 

Ai  =  ,min  -  1**1  -  lc*l}> 

l<fc<n 


(1.2) 


ci  =  bn  =  0. 

We  mention  that  Aj  and  A„  given  in  his  paper  are  misprints.  Ai  should  be  a  lower 
bound  for  the  smallest  eigenvalues  of  T„  and  am  upper  bound  for  the  largest  one.  They 
can  be  estimated  by  the  Gershgorin’s  Disk  Theorem.  For  example  W2m+i  has 

Tn  =  m  +  1  and  A1  =  -2. 


While  the  deleted  absolute  row  sums  are  used  in  Theorem  1.2,  we  may  have  following 
different  choices  by  columns 

\  W  +  l6*-il  +  kt+ilL  Ai  =  nun  {afc  -  |6fc-il  -  |ck+i|}, 

l<Kn  l<Kn 

with  b0  —  cn+i  =0. 

Note  that  Theorem  1.2  can  be  improved  and  the  refinement  is  stated  in  Theo¬ 
rem  4.3.  Table  1.1  compares  the  true  value  amd  several  lower  bounds  from  Theorem  1.1-3 
and  4.3  for  the  gap  of  W2 j.  In  fact  Sun  uses 

MW2m+J)  -  A„_!(W2m+1)  >  2p2-"  =  2  ( 

instead  of  Theorem  1.2  to  get  the  bound  1.06  x  10-15  in  the  table. 

Sun’s  [12,  Theorem  2]  for  gap  is  wrongly  stated  since  he  mistakes  |2x'(A,)/x"(A,)| 
for  |Aj+i  —  A, |,  where  x(^)  =  det(\I  —  T)  and  eigenvalues  {Ai}  of  T  are  in  increasing  order. 
The  true  formula  is 

X"(M  _  v-'  1 

2x'(A.)  '  A, -A  / 
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true  gap 

Theorem  1.1 

Theorem  1.2 

Theorem  1.3 

Theorem  4.3 

7  x  10" 14 

3.8  x  10~15 

1.06  X  10-15 

1.07  x  10“15 

1.24  X  10“i5 

Table  1.1:  Low  bounds  for  <?(W21). 

The  minimum  eigenvalue  separation  of  two  matrice®  is  defined  as  following.  Let 
{A,}  and  {pj}  be  eigenvalues  of  matrices  A  and  B  respectively,  where  A  and  B  may  have 
different  dimensions.  Then  for  all  pairs  of  A,  and 

g(A,B) min|Ai  -  p, | • 

Recall  Ax  and  An  given  in  Theorem  1.2. 

Theorem  1.3  (Sun  [12])  The  tridiagonal  matrices  T„  and  Tn_j  given  by  (1.2)  have  sep¬ 
aration 

g{Tn-i,Tn)  >  bk-iCk  ■  maxfp3"2",  p(n)1-np(rc  -  l)2'"}, 

k=2 

where  Pi  =  A„  -  \x,  p2  =  j~^[(2n  -  1)A„  -  tr(Tn)  -  tr(rn_! )], 
h  =  +  tr(Tn_!)  -  (2n  -  1)A1],  p  =  min{p!,  max(p2,  p3)}, 

p(j)  =  min{pi,  max(p2(j),  PaO))},  p?(j)  =  pr[jAn  - 

PiU)  =  jZi [*r(T})  -  jAj],  for  j  =  n,  n  -  1. 

We  remark  that  a  centrosymmetric  matrix,  which  is  symmetric  with  respect  to 
both  long  diagonals,  can  be  split  into  two  smaller  matrices  such  that  each  one  owns  half 
of  the  original  spectrum  [3].  See  section  1.6  for  more  detail.  Therefore  <7(W2])  is  equal  to 
the  gap  between  two  submatrices,  and  we  can  estimate  ff(W2i)  by  Theorem  1.3.  The  lower 
bound  is  listed  on  Table  1.1. 

In  quantum  mechanics,  an  eigenvalue  corresponds  to  the  energy  level  of  a  certain 
state.  A  separation  of  eigenvalues  thus  represents  the  energy  absorbed  (or  emitted)  in  the 
transition  from  one  state  to  another.  A  lot  of  recent  research  by  physicists  has  focussed  on 
the  first  excitation  energy,  or  in  mathematical  terms,  the  separation  between  the  first  two 
eigenvalues.  We  give  a  brief  survey  of  that  literature  here. 
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First  consider  the  Schrodinger  operator  in  general  dimension  n,  It  is  well  known 
that  the  lowest  eigenvalue  of  the  Schrodinger  operator  is  always  non- degenerate.  Hence  it 
makes  sense  to  discuss  the  gap  between  the  first  and  the  second  eigenvalues. 

Let  fl  be  a  smooth  strictly  convex  bounded  domain  in  Rn,  potential  V  :  Cl  — ►  R 
be  a  nonnegative  convex  function,  and  A  be  the  Laplace  operator.  The  eigenvalues  of 

f  -A/  +  V(x)f  =  A/, 

{  /  =  0  on  dft, 


can  be  arranged  in  nondecreasing  order  0  <  Ax  <  A2  <  A3  <  •  •  • .  Singer,  Wang,  Yau,  and 
Yau  [11]  prove  that 


where  d  is  the  diameter  of  fi.  Under  the  same  hypotheses,  Yu  and  Zhong  [16]  improve  the 
estimate  to 

T2 

A2  “  Al  -  &  ' 

Kirsch  and  Simon  [7]  give  a  Comparison  Theorem  for  A2  -  Aj  of  two  different 
Schrodinger  operators  and  use  it  to  find  new  bounds  on  the  lowest  band  in  a  solid.  For 
bounded  potential  V(x),  they  also  give  a  lower  bound  for  A2  -  A!  depending  on  the  geometry 
of  the  set  C  =  {x|  V(x)  <  A2}  and  the  maximum  value  of  |V(x)  -  v\  over  the  convex  hull  of 
C  and  v  G  [ Aj ,  A2] . 

For  the  one  dimensional  case,  the  Schrodinger  operator  becomes  —  ^  +  V'(x)  in 
an  interval.  It  is  also  known  that  all  the  eigenvalues  are  nondegenerate  and  can  be  ordered 
by  Aj  <  A2  <  A3  <  •  •  •.  Hence  each  separation  A;+i  -  A,  never  vanishs. 

Kirsch  and  Simon  [6]  provide  the  same  type  of  lower  bounds  for  A,+1  -  A,  in 
one  dimension  as  in  [7].  Consider  +  Y(x)  on  [a,i>]  with  either  Dirichlet  or  Neumann 
boundary  conditions  at  a  and  b.  Assume  V  6  C°°([a,6])  and 


7,  =  max{  |t/  -  V(x)|  j  :  x  G  (a,  b)  and  u  €  (A„  A,+I )}, 


then 

~  A,  >  7T7?exp[— 7,(6  -  a)]. 

The  exponential  factor  in  such  bounds  are  realized  precisely  in  tunneling  examples. 

For  the  one  dimensional  Schrodinger  operator  with  a  symmetric  single  well  poten¬ 
tial  V  (x)  in  a  interval  of  length  d  and  with  Dirichlet  boundary  conditions,  Ashbaugh  and 
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Benguria  [2]  obtain  the  optimal  lower  bound 

.  ^  3xa 

A2-A1>-5r. 

Equality  holds  if  and  only  if  the  potential  is  constant.  A  single  well  potential  in  [a,  6]  means 
there  is  a  c  €  [a,  6]  such  that  V  is  nonincreasing  for  x  <  c  and  nondecreasing  for  x  >  c. 

The  lower  bound  for  A2  —  Ai  comes  from  a  general  comparison  theorem  which 
is  stated  below.  Consider  two  Schrodinger  operators  ~2?r  +  U(x)  +^(z) 

Dirichlet  boundary  conditions.  Let  {A, (£7)}  and  {A,(V)}  be  their  eigenvalues  respectively. 
Assume  both  U  and  V  are  centrally  symmetric,  and  U  —  V  is  a  single  well  potential,  then 

Aa(£7)  -  Aj(£7)  >  A2(V)  —  A1(V) 


and  the  equality  holds  if  and  only  if  U  —  V  is  constant. 

The  discrete  analogue  of  Schrodinger  operator  is  a  tridiagonal  matrix  of  the  form 
—L  +  D,  where  L  is  the  discrete  laplacian 


L  =  tridiag 


-2 


-2 


1 


1 


-2 


\ 

/ 


and  D  is  a  diagonal  matrix.  Thus  all  the  previous  problems  reduce  to  estimating  the  gap 
between  two  adjacent  eigenvalues  of  tridiagonal  matrices. 

Ashbaugh  and  Benguria  published  the  discrete  version  of  their  results  in  [l].  For 
example  their  Comparison  Theorem  becomes 


A 2(~L  +  D)  -  Ai {-L  +  £>)>  A 2(-L  +  D)-  A x{-L  +  D), 


if  the  diagonals  of  D  and  D  are  centrally  symmetric,  and  that  of  D  -  D  is  symmetric  in¬ 
creasing  from  the  midpoint.  In  Chapter  3,  we  will  provide  a  simple  pioof  of  this  Comparison 
Theorem  using  our  eigenvectors’  ratios. 


1.3  Symmetries 

Only  symmetric  tridiagonal  matrices  are  considered  in  this  paper.  The  eigenvalue 
problem  is  trivial  for  diagonal  and  bidiagonal  matrices.  So  the  simplest  nontrivial  form  is 
the  tridiagonal.  However  there  is  no  loss  of  generality  since  every  symmetric  matrix  is  or¬ 
thogonally  equivalent  to  a  symmetric  tridiagonal  matrix.  In  fact  wre  assume  all  subdiagonal 
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entries  nonzero.  Such  tridiagonal  matrices  are  called  unreduced.  In  case  some  subdiago- 
nal  entries  vanish,  then  the  matrix  becomes  block  diagonal  with  each  block  an  unreduced 
tridiagonal.  We  can  take  care  of  eigenvalues  of  each  unreduced  submatrices  separately. 

The  unreduced  symmetric  tridiagonal  matrices  have  the  form 

01  02  •••  0n-l  \ 

T(  a)  =  tridiag  a(l)  a(2) .  a(n)  , 

\  01  02  0n-\  ) 

where  a  =  (a(l),a(2), . . . ,  a(n))  is  the  diagonal,  and  all  the  0,'s  are  nonzero.  The  tridiagonal 
matrix  T„  in  (1.2)  can  be  symmetrized  to  the  form  T(a)  by  scaling.  More  precisely,  there 
is  a  diagonal  matrix  D  such  that 

T(&)  =  DTnD-\ 

where  a(k)  =  a*  for  k  =  l,2,...,n,  and  Pk  —  ±\/bkCk+i  for  k=l,2, . . . ,  n  —  1. 

Before  formulating  our  minimization  problem,  it  is  necessary  to  understand  the 
behavior  of  eigenvalue  separations.  In  this  section  we  will  accumulate  the  background 
needed.  We  start  with  some  invariant  properties  of  eigenvalues  and  eigenvectors,  and  then 
explore  symmetries  under  which  gap  is  invariant. 

Let  {A,(T)}  and  {sj(T)}  be  the  eigenvalues  and  corresponding  normalized  eigen¬ 
vectors  of  matrix  T.  It’s  well  known  that  an  unreduced  symmetric  tridiagonal  matrix  has 
distinct  real  eigen  values  [9,  7-7-1].  In  this  paper  we  always  order  A,  increasingly,  i.e. 

Ai  <  A2  <  •  •  •  <  A„. 

Define  the  i-th  eigenvalue  separation  to  be 

9i(T)  :=  AA t(T)  =  A ,+1(T)  -  A.-(T), 

where  A  is  the  forward  difference.  Then  the  gap  defined  by  Definition  1  is  equi valent  to 

ff(r)  =  rmn  Oi(T).  (1.3) 

l<i<n— 1 

For  every  eigenvector  sj,  the  first  entry  Sj(l)  and  the  last  one  s,(n)  are  always 
nonzero  [9,  7-9-5].  An  easy  way  to  see  this  is  from  the  three-term  recurrence;  s;  =  0  if 
s,(l)  =  0.  Hence  without  loss  of  generality,  we  can  assume  s,(l)  >  0  for  all  i’s. 
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Theorem  1.4  Let  7  = 

/I 

-1 

1 

\ 

,  /  = 

/ 

1\ 

1 

,  e  = 

/  i\ 

1 

-1 

1 

u 

J 

1 

<1J 

a  =  (a(l),a(2), . .  .,a(n))T  and  cr  be  a  scalar,  then  for  i  =  1,2, . .  .,n, 
t.  Aj(T(a  +  <re))  =  Aj(T(a))  +  a,  and  s\{T(a  +  oe))  =  Si(T(a)), 
ii.  a))  =  — An_,(T(a)),  and  »i(T(-a))  =  7  s*(r(a)), 

*«•  Ai(T(<re  -  a))  =  o  -  An_i(T(a)),  and  sj (T(ere  -  a))  =  7  sj(T(a)), 

*u.  assume  in  addition  (3j  =  y3n_j  /or  j  =  1, 2, . . . ,  n  —  1,  then 
A,(T(7a))  =  Ai(T(a)),  and  Si(T(7a))  =  7  Si(T(a)). 

Proof. 

i.  Since  T(a  +  ere)  =  T(a)  +  <r7, 

A,(T(a  +  <7e))  *  A,(T(a))  +  a 
and  eigenvector  s;(T(a  +  ere))  remains  unchanged. 

ii.  Observe  that  7-1  =  7  and  IT(a)I  =  —T(— a).  Since  eigenvalues  are  ordered  increas¬ 
ingly,  Aj(T(-a))  =  A,(— 7T(a)7)  =  A,(-T(a))  =  -An_.(T(a)).  Also 

Si(T(-a))  =  Si(— T(— a))  =  Si(/r(a)7)  =  /«i(T(a)). 

iii.  Combine  previous  two, 

A,(T(<re  —  a))  =  er  +  A<(r(-a))  =  <r  -  An_,(T(a)) 

and 

«i(T((7e-a))  =  Si(r(-a))  =  7sj(r(a)). 

iv.  By  the  symmetry  of  fl's,  7T(a)7  =  -T(fa).  Since  7_1  =  7, 

A,(T(7a))  =  A,(T(a))  and  Si(T(7a))  =  Si(7r(a)7)  =  7Si(r(a)). 
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□ 

Although  eigenvalues  are  invariant  under  all  similarity  transforms,  not  many  of 
them  preserve  the  tridiagonal  structure.  7  and  7  are  among  such  few  examples.  Notice  that 
vector  7a  is  just  the  reverse  order  of  a. 

Definition  2  Given  vector  &  =  (a(l),a(2),. . .,  a(n))  and  scalar  u. 
i.  We  denote  the  reverse  order  of  a 

a*  :=  7a  =  (a(n),a(n  -  1), .  ..,a(l)). 

it.  We  define  the  dual  of  a.  with  respect  to  u  by 

a'  :=  ue  -  a  =  (w  -  fl(l),w  —  a(2),  —  a(n)). 

The  duality  is  the  combination  of  shift  and  reflection  about  origin.  Of  course  we 
can  combine  different  operations  in  Theorem  1.4  to  get  other  symmetries. 

As  a  simple  consequence  of  Theorem  1.4,  the  gap  is  invariant  under  shift,  reflection 
with  respect  to  origin,  and  duality.  In  addition  if  fij's  axe  symmetric,  then  it’s  invariant 
under  reversing  diagonal. 

Corollary  1.5  Assume  e,  a,  o  are  the  same  as  in  Theorem  1.4,  then  for  i  —  1.2, _ n 

i.  g,{T{ a  +  <re))  =  g,(T{ a)),  and  g{T( a  +  ere))  =  g{T( a)), 

ii.  g,(T(-a))  =  gn-i(T(a)),  and  g(T(-a))  =  g(T(a)), 
tit.  g,{T{ a*))  =  p„_,(T( a)),  and  g{T{ a*))  =  p(T(a)), 

iv.  assume  in  addition  pj  =  Pn-j  for  j  =  1, 2, . . . ,  n  —  1,  then 
<7,(r(a*))  =  gt(T(  a)),  and  g(T(a*))  =  *(r(a)) 

1.4  Normalization  of  the  Problem 

One  certainly  can  try  to  find  a  tridiagonal  matrix  with  gap  smaller  than  that  of 
W2n+i,  but  we  ask  a  more  general  question:  what  is  the  minimal  gap  over  this  type  of 
matrix?  To  make  this  minimization  problem  well  posed,  some  normalization  is  needed.  A 
few  simple  observations  will  help: 
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unreduced  tridiagonal  Our  problem  becomes  trivial  unless  we  exclude  those  matrices 
with  multiple  eigenvalues,  e.g.  the  identity  matrix.  Therefore  we  only  consider  unre¬ 
duced  tridiagonal  matrices  which  always  have  distinct  eigenvalues. 

symmetric  tridiagonal  Also  we  only  deal  with  symmetric  case  so  that  all  the  eigenval¬ 
ues  are  real.  The  gap  certainly  can  be  defined,  in  a  similar  fashion,  for  complex 
eigenvalues.  But  it  is  more  difficult  to  analyze. 

/3jt’s  bounded  below  The  off-diagonal  elements  play  an  important  role  for  the  distribution 
of  eigenvalues.  Consider 


whose  eigenvalues  are  ±«.  Its  gap  2e  has  no  lower  bound  if  we  let  e  — *  0.  Therefore 
we  need  to  fix  a  lower  bound  on  the  off-diagonal  elements. 

scalar  multiplication  Notice  that  the  gap  function  is  homogeneous  under  scalar  multi¬ 
plication,  i.e. 

g{cT)  =  ag{T) 

for  all  scalar  a.  Hence  we  can  fix  all  the  off-diagonal  elements  /?*  >  1,  which  is  just  a 
normalization  constant. 

n=2  The  eigenvalues  of  Jiia)  are 

i[a(l)  +  a(2)  ±  v/(a(l)-a(2))2  +  4  ]. 

Hence  the  gap 

>  2. 

Equality  holds  if  and  only  if  a(l)  =  a(2).  Therefore  there  are  infinitely  many  mini- 
mizers.  From  now  on  only  n  >  2  is  considered. 

diagonal  spread  The  size  of  the  diagonal  spread  is  crucial.  In  the  next  chapter  we  will 
show  that  J( Wi)  has  gap  approximately  2/wn_2,  which  can  be  as  small  as  possible 
when  u>  is  not  bounded.  Hence  the  diagonal  spread  has  to  be  fixed. 

shift  By  the  shift  invariance  Corollary  1.5(i),  we  can  take  min  a(k)  =  0.  Assume  the 

l<fc<n 

diagonal  spread  is  u>,  then  max  a(k)  =  u/.  In  other  words  we  restrict  all  the  diagonal 
a(i)’s  to  lie  in  the  interval  [0,w]  and  in  no  smaller  interval. 
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u  [0, 0.1]  [0.2, 0.5]  [0.6,  1.5]  [1.6, 4]  [4.1,  oo] 

a  w5 _ W4 _ w$ _ W2 _ wx 

Table  1.2:  Minimizers  for  g(J( a))  with  n  =  15. 

n  fixed  The  dimension  n  of  the  matrix  can  not  be  free  either.  The  eigenvalues  of  \  Jn{ 0) 
are  the  roots  of  the  n-th  order  Chebyshev  polynomial  of  second  kind,  i.e.  cos— y  for 
k=l,2,.  ..,n.  Hence 

,  t  „  *  2r  A  .  3ir  .  n  3r2 

g(Jn( 0))  =  2  cos - 2  cos - =  4sm  — - —  sin  — - —  w  - — r 

yv  v  ”  n  +  1  n  +  1  2(n+l)  2(n+l)  (n  +  l)2 

when  n  is  large. 

Now  we  can  formulate  a  well  defined  minimization  problem.  For  n  >  3  and  u>  >  0 
given,  we  seek 

min  </(J(a))  subject  to  0  <  a(i)  <  u>  for  i  =  1, 2, . . . ,  n.  (1.4) 

Parlett  [10]  conjectures  that  J( wj)  minimizes  (1.4)  for  u>  >  4.  But  in  fact  the 
minimizing  matrix  depends  on  the  size  of  the  diagonal  spread  u>.  Let 

wj(w)  :=  (u>, ... ,q;, 0, 0,u;, ..  .,(*>) 

j  1 

for  j  =  1,2,..  .,[§].  Table  1.2  shows  the  extremal  wj(u>)  for  different  size  of  w  when  n  =  15. 
W5  has  smallest  gap  initially,  and  W4  beats  W5  after  a  while,  then  W3  takes  over  and  loses 
to  wj  later.  In  the  end  wj  wins  the  crown  when  u  is  large  enough. 

For  the  general  n,  wj  |j  is  tbe  starting  minimizer  for  tiny  w,  and  then  . . . , 

etc.  For  large  w,  wj  is  the  ultimate  winner,  and  this  will  be  proved  in  Chapter  4.  Therefore 
(1.4)  is  not  a  simple  algebraic  problem. 

Corollary  1.5(iii)  shows  that  J( wj)  and  /(wj*)  have  the  same  gap,  where 

wj*  :=  (0, . . . ,  0,  u;, . . . ,  u>,  0, . . . ,  0) 
i  j 

is  the  dual  of  wj.  Hence  wj*  is  also  a  minimizer  if  wj  is.  Thus  (1.4)  has  at  least  two 
minimizers. 


13 


Our  final  remark  is  that  J2m+i(wi(m))  has  smaller  gap  than  W2m+i.  Indeed 
^2m+i(wi(m)))  w  m22m_1  and  g(W2m+1)  *  (m\)~2 . 

For  example,  5(^2i(wi(10)))  =  2  x  10*19  while  g(W22)  =  7  x  10-14. 

1.5  Gradient  of  p, 

In  this  section  we  fix  all  the  subdiagonal  entries  /?*’ s,  and  consider  A;  and  g,  as  a 
function  of  the  diagonal  a.  It  is  surprising  that  V<7,  =  Vag,  has  a  simple  form.  Recall  that 
is  the  normalized  eigenvector  of  A,,  i.e. 

T(a)si  =  A, Si  (1.5) 

and 

i  lsi  1 12  =  si^”si  =  1  (1*6) 

Definition  3  The  Schur  product  of  vectors  u  and  v  is  an  element-hy- element  multiplication 

uo  v  :=  (u(l)v(l), ti(2)v(2), . . . , u(n)v(n)). 

For  example  consider  e  =  (1,  -1, 1, -1, . . .),  then  in  Section  1.3  /v  =  e  o  v. 
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Figure  1.1:  gx  and  g2  for  J3((4,0,x)). 


Thus  for  i  =  1, 2, . . . ,  n 

VA,  =  sj  o  sj  and  V<7,  =  VA,+1  —  VA,  =  sj+i  o  sj+1  -  Sj  o  sj. 


□ 

By  Definition  1  of  gap,  or  by  (1.3),  we  expect  that  g{T{ a))  fails  to  be  differentiable 
at  certain  values  of  a.  But  restricted  to  each  open  domain  where  g  =  g,  for  some  i,  Vg  is 
just  V<7i.  For  example,  in  Figure  1.1  we  plot  g\  and  g2  for  J3((4,0,x))  with  respect  to  x 
in  [0,3].  It  is  clear  that  g  =  g \  when  x  <  1.3,  and  g  =  g2  when  x  >  1.3.  Indeed  g  is  not 
differentiable  at  the  intersection  x  =  1.3. 

Before  we  carry  on,  we  need  to  clear  up  our  notation.  Consider  a  smooth  function 
h  :  Rn  — *  Rn,  Va  gt(T{h(a)))  represents  the  gradient  of  the  composite  function  of  T(h(&)) 
and  g,  with  respect  to  a,  while  Vijj(T(/i(a)))  means  the  evaluation  of  V<7,  at  T(h( a)).  Note 
that 

VaS,(r(a))  =  V5i(T(a)). 

We  can  get  more  information  on  V^,  by  Corollary  1.5  and  Lemma  1.6.  The 
following  Corollary  says  ^gi(T( a))  is  invariant  under  shift,  always  perpendicular  to  e,  and 
equal  to  the  negative  of  Vjn_,(T(a*)).  Furthermore  if  /Sy’s  are  symmetric,  then  Vg,(T(aR)) 
is  the  reverse  order  of  Vj,(T(a)). 

Corollary  1.7  Assume  e,  a,  a*,  a*  are  the  same  as  in  Theorem  1.4,  Definition  2  respec¬ 
tively.  Then  for  scalars  a,  u  and  i  =  1, 2, . . . ,  n 
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i.  V*(r(a+<re))  =  V5i(r(a)); 
it.  Vgt(T(a))  ±  e; 
m.  V5l(T(a*))  =  — V5n_,(T( a)); 

tv.  assume  in  addition  0:  =  0n-j  for  j  =  1, 2, . . . ,  n  -  1,  then 
Vffl(T(a«))  =  [V<7,(r(a))]«. 

Proof.  Note  that  V<7<  is  a  row  vector,  and  matrix  Dah( a)  is  the  total  derivative  of  h  with 
respect  to  a. 

i.  Va  g,(T( a))  =  Va  g,(T( a  +  oe))  =  [Vgt(T(&  +  ere))]  Da( a  +  <re)  =  V5,(r(a  +  ere)). 

ii.  V<7,(T(a))e  =  [V<7,(r(a  +  ere))]  e  =  £g,(T(a  +  ere))  =  £<7.(T(a))  =  0. 

Here  is  another  proof  using  the  normalization  of  sj’s, 

Vg,(T( a))e  =  £  s.+1(*)2  -  s,(fc)2  =  ||sI+1||2  -  ||Si||2  =  0. 

k=l 

iii.  va  <7„-.(r(a))  =  Va  ffl(T(u,e  -  a))  =  [V*(T(u*  -  a))]  Da(u,e  -  a)  =  -  Vp,(r(a')). 

iv.  Consider  /  in  Theorem  1.4  and  0j  =  0„-j  for  j  =  1, 2, . . . ,  n  —  1,  then 

Va  g,(T(&))  =  Va  gi(T(i a))  =  [Vffl(T(/a))]  Z?„(/a)  =  (%(T(a*))]  /  =  [Vg,(T(aR))}R. 

□ 


1.6  Normalized  Eigenvectors 


Another  surprising  fact  about  tridiagonal  matrices  is  that  we  have  explicit  formulae 
for  their  normalized  eigenvectors  in  terms  of  eigenvalues.  The  following  theorem  was  given 
by  Paige  in  [8],  as  a  corollary  of  Thompson  and  McEnteggert’s  result  on  adjugates  [13].  We 
refer  the  readers  to  [9,  Section  7.9]  for  more  detail. 

Before  stating  the  result,  we  need  the  following  notation 


/ 


T}, k(a)  :=  tridiag 


o{j) 


0j  •••  0k- l  ^ 

.  a(fc) 

0]  • 0k- 1  / 


for  j  <  k. 
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f  det[A/  -  Tjjkl  if  j<fc; 
xjjcW  ■=  s ,  .  . 

1 1,  lf]>k. 

Xjfc(A)  denotes  the  derivative  of  Xj,k( A)  with  respect  to  A  and  v(k)  is  the  k-th  entry  of 
vector  v. 

Theorem  1.8  Let  {A,}  and  {s;}  be  eigenvalues  and  normalized  eigenvectors  o}T\t1x ■  Then 

i.  for  1  <  i  <  n  and  1  <  j  <  k  <  n 

Xi,nM*iU)s'(k)  =  xij-i(*i)0j---Pk-i  x*+i.»(A<); 

ii.  when  A ,  is  a  simple  eigenvalue  (or  T\<n  unreduced), 

SiU)2  =  XiJ-i(A,)xJ+i.n(A.)/xi.„(A,). 


Corollary  1.9  For  i  =  l,2,...,n 

i.  s,(l)a,(n)x,l  n(A,)  =  0\ 0i  •  •  •  0n— l ! 

ii.  ■s.(l)2X'i,„(A.)  =  X2.n(A.); 

*«•  ■sl(n)2x/j,„(At)  =  Xi,f.-i(A.); 

*«•  Xi.n-i(A,)X2.n(A,)  =  0\0\  ■  --01-1- 

Proof. 

i.  Take  j  =  1  and  k  =  n  in  Theorem  1.8  (i). 

ii.  Take  j  =  k  =  1  in  Theorem  1.8  (ii). 

iii.  Take  j  -  k  =  n  in  Theorem  1.8  (ii). 


iv.  By  (i-iii) 

Xi.n-i(A.)X2,„(Ai)  =  Ml)*(n)xUA.)]2  =  f[  Pi 

k=i 

□ 

A  matrix  A  is  called  centrosymmetric  if  it  is  symmetric  with  respect  to  both  long 
diagonals,  i.e. 


At  =  A  and  }A'l  =  A 
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where  /  is  defined  in  Theorem  1.4.  Another  way  to  characterize  such  a  matrix  A  =  (a,,j)  is 

&i,j  ~  &j,i  =  Qn+l  —  j,n+l—i  =  Qn+l  —  i.n-fl  —  j  for  i,j  —  1,2,  .  .  .  ,  71. 


A  centrosymmetric  matrix  can  be  split  into  two  matrices  of  half  the  size  such  that 
each  owns  half  of  the  original  spectrum.  Half  of  its  eigenvectors  are  symmetric ,  i.e.  v  R  =  v, 
and  the  other  half  are  skew  symmetric ,  i.e.  vR  =  —  v.  Here  is  the  reverse  order  of  v 
defined  in  Definition  2  (i).  We  refer  the  readers  to  Cantoni  and  Butler’s  complete  survey 
on  centrosymmetric  matrices  [3]. 

Centrosymmetric  tridiagonal  matrices  have  the  form 


l  01 


T„  =  tridiag 


«(1) 


V  01 


02  ■■■  02 

a(2)  .  a(  2) 

02  ■  02 


01  \ 

a(  1) 

01 


Let  {A,}  be  the  eigenvalues  of  Tn  in  ascending  order  and  {sj}  be  the  corresponding  normal¬ 
ized  eigenvectors  with  s,(l)  >  0.  Then  sj’s  are  alternately  symmetric  and  skew  symmetric 
provided  A,’s  are  distinct.  A  complete  picture  of  their  behavior  is  stated  below. 


Even  Case  Suppose  n  =  2m.  Define  two  matrices 

(  0\  •••  0m- 2 


T*  =  tridiag 


0m- 1 


a(  1) 


a(m  -  1) 


a(m)  ±  0, 


V 


\ 


/ 


01  '  '  '  0m  —  2  0m  —  1 

Let  {/if}  be  the  eigenvalues  of  T±  in  increasing  order  and  {u^}  be  the  normalized 
eigenvectors  with  uf(l)  >  0.  If  A,’s  are  distinct  and  0m  >  0,  then 

A2fc-x=M£\  »2k-l  =  •^[uk,-(ui)/i], 

=  »2k  =  ^|luk’(uk)H] 

for  k  =  1, 2, . . . ,  m.  The  situation  is  reversed  if  0m  <  0. 

Odd  Case  Suppose  n  =  2m  +  1.  Define 


TZ  =  tridiag 


f  01  0m- 1  } 

a(l) .  a(m) 

\  01  0  m  — l  ) 
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^  01  •  •  •  /?m-l  V20m  N 

^m+i  =  tridia-g  a(l) .  a(m)  a(m  +  1)  . 

V  01  •••  /?m_!  v^m  / 

Similarly  define  {/xjji  (u?}i  and  {u|}  35  above.  If  A,’s  are  distinct,  then 

^2k  =  Mfc,  s2k  =  for  k  = 


and  A2fc_i  = 

s2k-l  =  ^=(Ufc(l),...,ttI(m),V^u[(m  +  l),«l(m) . u[(l)) 

for  A:  =  1, 2, . . . ,  m  +  1. 

Notice  that  W2fn+1  is  centrosymmetric  and  the  eigenvalues  of  W£  interlace  those 


°fHr^i+1.  Hence 


S(^m+i)=3(C,lVl+1), 


which  can  be  estimated  by  Theorem  1.3. 


Corollary  1.10  For  a  centrosymmetric  tridiagonal  matrix  T  with  positive  subdiagonals, 
i.  st(k)  =  (-l)"-'s,(n  +  1  -  k); 

»•  3i(j)2  =  X 1  .J  —  1  ( -^«  )2  0]  ’  '  -0n-j  /  !xi.n(^*')l- 

n  ^  (A,-a(i))jnn  x 

Hi.  5.(1)2  =  -  and  s,(  2)2  =  - * - . 

I"I  |A,-A„|  |A,-A„| 

l  umi 

ti 

for  i,k  =  1,2 . n  and  j  =  1,2,  —  »l  s^i]- 

Proof. 

i.  It  follows  from  previous  results  on  (skew)  symmetry  of  eigenvectors.  It  can  be  proved 
by  Theorem  1.8  as  well.  By  centrosymmetry  of  T , 


Xl.fc-l(^i)  —  Xn+2 -k.n(^i)  X*+J,n(^»)  “  Xl,n-fc(^<)- 
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Hence  |s,(fc)|  =  |s,(n+l  —  fc)|  by  Theorem  1.8  (ii).  Take  k  =  n+l—j  in  Theorem  1.8 

0). 

1  ~  J)  =  -  -  /?«->  Xn+a— i.n(Ai) 

=  Xi  j-i(^t)2  ' '  '@i-\  >  0-  (1*7) 

Then 

n 

3«ffn[3,(j)a,(n  +  1  -  j)]  =  «$n[x'lifl(Ai)]  =  ajffn[JJ(A,  -  A„)]  =  (-1)"-'. 

Um  1 

ii.  Take  absolute  value  of  (1.7)  and  use  (i). 

iii.  Take  j  =  1  and  j  —  2  in  (ii). 

□ 
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Chapter  2 

Extremal  Matrix 


In  this  chapter  we  devote  ourselves  to  the  extremal  matrix 


/ 


=  tridiag 


UJ 


1  •••  1  1  \ 

0 . 0  u 

1  •••  1  1  / 


We  will  discuss  its  eigenvalues,  eigenvectors,  and  gap  in  full  detail.  It  deserves  our  attention 
because  <7n-i  (*fn(wi ))  minimizes  our  objective  (1.4)  for  large  enough  u.  But  more  important 
is  that  we  use  this  gap  to  eliminate  the  other  separations  and  solve  (1.4). 


2.1  Eigenvalues  and  Eigenvectors 

Suppose 

•Jn(wi)v(A)  =  Av(A) 

with  v(A)  =  (^i(A),i/2(A),...,i/„(A)),  then 

(<*>  -  A)»/i(A)  +  ui{\)  =0, 

Vk-iW  -  Ai/*(A)  +  i/fc+i(A)  =0,  k  =  2,3,...,n  -  1, 

*/n-i(A)  +  (u>  —  X)vn(X)  =0.  (2.1) 

If  we  set  t/o(A)  =  w  and  ^i(A)  =  1,  then  we  can  extend  the  three-term  recurrence 

i/t+1(A)  =  A^fc(A)  -  i/fc_i(A).  (2.2) 

to  k  =  1,2, ...,n  -  1. 


Recall  that  the  Chebyshev  polynomials  of  second  kind  satisfy  the  same  recurrence 
relation.  To  be  more  precise,  let 


•in(co*-1  x)  '  1  -  ’ 

•inh(t  co«h_l  r)  i_i  ^ 

rinhfcorii-l  »)  I1'  >  1; 


for  k  =  0, 1, 2,  —  Then 


Uk+i(z)  =  2xUk(x)  —  Uk-i(x), 


for  k  =  0, 1, 2, . . with  U-X(x)  =  0,  Uq(x )  =  1,  Ux(x)  =  2x,  etc.  Observe  that 

Uo(\)  =  Ux(  £)  =  A,  U2(±),  U3(±),  ..., 


U-i(±)  =  0,  «7o(|)  =  1,  . 

are  two  linearly  independent  solutions  of  (2.2),  so  the  sequence  {i/j(A)|t  =  0, 1, . . .,  n}  can 
be  expressed  as  their  linear  combination.  Indeed 

«W*)=  U^-uUk.j^)  for  fc  —  0,  l,...,n  —  1.  (2.4) 

Notice  that  the  Chebyshev  polynomials  of  first  kind  T*(x)  obey  (2.3)  too,  but  have 
different  initial  conditions 

7b(x)  =  1,  Ti(x)  =  x,  T2(x)  =  2x2  -  1.  etc. 


Therefore  we  can  have  different  expressions  for  v*,  for  example 
■W-M  =  + 

4w  —  2A  A  4  —  2Ao>  _  A 

-  4r*rI«5)+ -nr “-«(*> 

for  k  =  0, 1, . . . ,  n  —  1.  As  a  matter  of  fact,  we  can  solve  the  difference  equation(2.2)  directly. 
( aje  two  linearly  independent  solutions  if  A  ^  ±2.  Thus 

...  t  \\  _  \i. .  ■  —  Au>  . .  A  —  4  k  \  .  2  —  Au;  A  —  t/A2  —  4  fc 


*<*>-i<"+7Sr7X 
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Cleaxly  (2.4)  is  the  simplest  form.  Notice  that  we  have  not  used  (2.1)  yet,  which 
gives  us  the  eigenvalue  equation.  Hence  we  can  solve  for  all  the  eigenvalues,  and  then  find 
all  the  eigenvectors  by  (2.4).  For  centrosymmetric  Jn(wj),  however,  the  spectrum  can  be 
split  by  the  technique  in  Section  1.6.  Instead  of  original  matrix,  we  then  deal  with  two 
matrices  of  half  the  size. 

We  first  locate  the  eigenvalue  {A,}  of  J„(wi)  by  Gershgorin’s  Disk  Theorem,  which 
says  Xi  €  [—2,2]  U  [a;  -  l,w  +  1]  for  all  t’s.  For  an  irreducible  matrix,  a  boundary  point 
of  the  Gershgorin  disks  can  be  an  eigenvalue  only  if  every  Gershgorin  circle  lies  on  it  [5, 
Theorem  6.2.26].  Thus  when  ui  >  3, 

-  2  <  Ax  <  •  •  •  <  A„_2  <  2  <  u  -  1  <  An_x  <  An  <  u  +  1.  (2.5) 

We  assume  this  case  in  the  sequel. 


Even  Case  Suppose  n  =  2m  and 

/I  1  •••  1  1  \ 

J*  :=  tridiag  u  0 . 0  ±1  , 

V  1  1  •••  1  1 

then  {A2jk_i}jJ'_1  constitutes  the  spectrum  of  J~  and  {A2*}t=1  constitutes  that  of  J+  while 

u(A,)  =  (^i(A,),^2(A,  ),...,  i/m(A,)) 

is  the  corresponding  eigenvector. 

Using  the  last  equation  in  system  7*u(A)  =  Au(A),  we  have 


—  l ( X)  +  (±1  -  A)i/m(A)  =  0, 
ii'm(A)  =  Xi/m(X)  —  i/m_i(A)  =  t/m+x(A), 

1(5)  .  Um(±)  -  wEf — ,(i), 

±  ^-1(5)  -  (2.6) 

by  (2.2)  and  (2.4).  As  expected,  the  eigenvalue  A  is  a  zero  of  a  polynomial  of  degree  m.  In 
fact  (2.6)  is  nothing  but  the  characteristic  equation  det[XI  -  J*]  =  0. 

Suppose  A  is  An_x  or  A„,  then  y  >  1  by  (2.5).  So  we  can  change  variable  y  = 
cosh  <f>,  where  0  ^  0,  in  (2.6)  to  get 


±£m— i(cosh0)-  Um(cosh0)  =  u/[±£/m_j(cosh0)  -  Um_x(cosh0)], 
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±  sinh  m«p  —  sinh(m  +  l)d>  =  u[±  s inh(m  —  l)d>  -  sinh  m<f>], 
f  (+)  :  —2  sinh  ^  cosli(m  +  5)^  =  -2wsinh  ^  cosh(m  -  ^)<£, 

\  (-)  :  -2sinh(m  +  ^)<f>  cosh  \  =  -2u/sinh(m  -  \)(j>  cosh  §, 

{(+) :  cosh(m  +  \)<f>  =  u>  cosh(m  - 
(-) :  sinh(m  +  \)4>  —  wsinh(m  -  \)<j>. 

Again  consider  t  =  e^,  then  t  >  0,  t  ^  1, 

cosh  l<(>  =  +  e-t4>)  =  i(t'  +  i), 


sinh^=i(e^-e-^)  =  i(^-i). 

Hence  (2.7)  can  be  further  simplified  to 

tm+$-  4.  f-(">+^r)  *2m+l  +  1  t2m  _  *2m-l  _| - t+l 


(+)  :  u>  = 


(-)  :  u>  = 


t2m  +  t  <2m-l  _  t2m-2  _| - t2  +  t' 

tm+i  -  t-(m+V)  t2m+1  —  1  t2m  +  t2m_1  + - f-  t  +  1 


tm~%  -  H"—?)  t2m  -  t  f2"»-l  +  t2m-2  +  .  .  ,+  t2  +  t  V_'  ' 

When  A  is  one  of  A2,...,An_2,  y  €  (-1,1)  by  (2.5).  We  can  change  variable 
A  =  2  cos  if  =  z  4-  where  z  =  e**  and  if  €  (0,  x).  Notice  that  sin  if  ^  0  for  all-in  (0.  r), 
thus  z  is  not  a  real  number.  The  same  computation  shows  that  z  satisfies  (2.8)  or  (2.9)  as 


Both  (2.8)  and  (2.9)  have  2m  roots.  Replacing  t  by  j-  in  (2.8)  and  (2.9),  we  get 
the  same  equations.  Therefore  their  roots  come  in  pairs,  i.e.  t  and  A.  However  we  recover 
the  same  eigenvalue  A  from  both  t  and  A. 

Hence  (2.8)  has  a  pair  of  positive  roots  corresponding  to  An,  and  the  other  zeros 
on  unit  circle  corresponding  to  A2,  A4, . . .,  A„_2.  Similarly  two  positive  zeros  of  (2.9)  give 
A„_i  and  the  other  complex  roots  give  Aj,  A3, . . A„_3. 

Odd  Case  Suppose  n  =  2m  4-  1.  The 


:=  tridiag 


has  eigenvalues  {A2*}£Lj,  which  satisfy 


— l(A)  ^t/m('^)  —  0- 


Thus  by  (2.2)  and  (2.4) 
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Um(±)-uUm-l(±)  =  vm+l{\)  =  0. 

Change  variable 

A  =  2  cosh  <j>  =  e*  +  e~*  =  t  +  A,  if  A  >  1; 

A  =  2cos<,p  =  e* *  +  e~‘v  =  t  +  if  |A(  <  1; 


! 


(2.10) 


then 


u) 


(2.11) 


_  ^m(-y)  -  f2m+2  -  1  _  t2m  +  t2m-2  +  •  •  •  +  t2  +  1 
~  Um- i(y )  ~  <2m+1  -  *  -  t2m_1  +  <2m~3  +  •  •  •  +  t3  +  t ' 

(2.11)  has  2m  zeros  which  are  in  pairs.  An_j  comes  from  the  only  pair  of  positive 
roots,  and  A2,  A*, . . An_3  come  from  the  other  m  —  1  pairs  of  complex  roots. 

In  the  other  hand 


JL+i  •=  tridiag 


V 


1  1 

u>  0 
1  1 


1  V2  \ 

■  0  0 
1  V2  ) 


has  eigenvalues  -(A2jt— 1 and  eigenvectors 

(^(Aafc-i),. .  .,t'm(A2fc_i),-^t'm+i(A2t-i)). 

The  eigenvalue  equation  is 


V2i/m(X)  -  X—=i/m+i(X)  =  0. 


Thus  by  (2.2)  and  (2.4) 


^mfA)  —  Al'nj+^A)  —  Urn{  A)  —  t'nj+2(A), 

crm.,(j)  -wVm.,(±)  =  trm+1(|) 

<■>  l«W  5)  -  )]  =  tW.(f )  -  ), 

Using  (2.10),  we  have 

w  =  Um+lib-Um-lib  =  =  t 2"+2  +  1 

Um($)  ~  Um-2(j)  t"  +  *'m  t2m+1+f 

(2.12)  has  2m  +  2  zeros  and  they  comes  in  pairs  as  well.  The  only  positive  pair  gives  A„, 
and  the  other  m  complex  pairs  give  Aj ,  A3, . . . ,  A„_2. 
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We  summarize  as  follows.  Let  r  and  r'  be  the  smallest  positive  roots  of  p(t)  =  u 
and  q(t)  =  u  respectively,  where 


p(t)  := 


tn+1  +  1 
tn  +  t 


(2.13) 


and 


g(t)  := 


tn+l  -  1 
tn  -  t  ' 


(2.14) 


Then 


A„  =  r  +  i  and  An_!  =  r'  4-  (2.15) 

Since  p(t)  and  q(t)  are  negative  in  (— oo,0),  decrease  from  +oo  to  1  in  (0,1),  and  increase 
from  1  to  4-00  in  [1, 4-oo),  each  function  hits  u  (>  3)  exactly  twice.  In  fact  r  and  A  are  the 
only  real  roots  of  p(t)  =  u> ,  and  so  are  r'  and  p  of  q(t)  =  u. 


2.2  Asymptotic  Expansions 

In  this  section  we  compute  the  asympto.. ;  expansions  of  A„  and  An_!.  The  tool 
is  the  Hensel  iteration  in  the  power  series  domain  R[[x]].  Let  f(y)  be  a  polynomial  with 
coefficients  in  R[[x]]  and  yo  in  R  be  an  0{x)  approximation  to  a  root  of  /(y),  i.e.  y  —  yo 
is  a  solution  to  f(y)  =  0  when  x  =  0.  Suppose  /'(yo)  :=  $£(yo)  #  0  when  x  =  0,  then  the 
sequence  of  iterations  defined  by 

yfc  :=  yjc-i  -  (mod  xk+l)  k=  1,2,...  (2.16) 

has  the  property  that  yt  is  an  0(xfc+1)  approximation  to  y.  This  is  the  Hensel  iteration, 
which  converges  linearly. 

Under  the  same  hypotheses  we  have  the  well  known  Newton  iteration 

Vk  :=  Vk-i  -  y7Q~_^  (modi2*)  k=  1,2,..., 

which  has  the  property  that  yk  is  am  0( x2*)  approximation  to  y.  The  quadratically  conver¬ 
gent  Newton  iteration  is  superior  to  the  Hensel  iteration  in  general.  However  each  step  in 
Hensel  iteration  is  less  expensive  since  no  recomputation  of  /'  is  required.  Therefore  Hensel 
iteration  seems  cheaper  if  only  a  few  terms  in  power  series  are  needed.  This  is  the  reason 
why  we  use  (2.16)  in  the  following  computation. 
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We  are  ready  to  find  the  asymptotic  expansion  of  A„(w).  First  we  convert  An  to  a 
power  series  in  e  near  t  =  0  by  setting  Jj.  Recall  A„  comes  from  p(t)  -  w  =  0,  which 
now  becomes 

tn+\  +  1  j 

tn  +  t  ~7  ~ 

Therefore  we  consider 

/(*)  S  f(in+1  +  1)  -  (tn  + 1). 

and  its  smallest  positive  root  r.  We  apply  the  Hensel  iteration  on  /  to  expand  r  as  a  power 
series  of  e. 

To  obtain  the  initial  value  to,  consider  e  =  0.  Then  f(t)  =  — +  1)  =  0,  and 
to  =  0  will  be  the  appropriate  guess  for  the  smallest  positive  root  of  /.  By  (2.16) 

'• =  ,0  ~  7m  (mod  f!)  =  '• 

Since 

f(h)  =  e((n+1  +  1)  -  (en  +  e)  =  e"+2  -  e”, 

1 1  is  an  d(e")  approximation  to  r,  which  is  far  more  promising  than  t0.  Hence  it  is  better 
to  start  the  Hensel  iteration  from  t\  so  that  each  iteration  improves  n  —  1  terms.  More 
precisely  let  t2,  be  the  successive  approximations  by  (2.16),  then 

At*  =  -  (mod  e1+("-lH*+1)),  (2.17) 

for  k  >  1,  where  A  is  the  forward  difference. 

Observe  that 

f'(t')  =  f(n  +  l)t?  -  nt?"1  -  1  =  -[1  +  nen"1  -  (n  +  l)en+1], 

and 

y~j  =  1  -  [nen~1  -  (»  +  lK*+1]  +  [ns"-1  -  (n  +  l)e"+1]2  +  •  •  -  =  1  +  O (e"'1). 

Thus 

A<r  =  /(ti)((l  +  O^"-1))  =  («n+2  -  en)(l  +  0(<"-1))  s  — <n  +  e"+2  (mode2"-1), 

t2  =  fi  +  Ati  -  (  -  e"  +  fn+2, 

f(t2)  —  €  (f2+1  +  1)  ~  (ij  fj)  =  ( [(^1  Atj )n+1  +  l]  —  [(tj  4-  Ati )n  +  (ti  4-  Afi )] 
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=  «[i?+1  +(n  +  l)t?Ata  +  +  +  0(e4n-3)  +  1] 

-[t?  +  nt^Ati  +  Q  t?~2  At?  +  0(€4"'3)  +  tx  +  Ati] 

=  «(t?+1  +  1)  -  (tj  +  ti)  +  t^AtiKn  -H  1)«  -  n] 

+|tJ-JA*f[(n  +  l)€f!  -  (n  -  1)]  -  A^  +  0(e4"-3) 

=  f<S)  ~  At!  +  en-\-tn  +  <"+2)[(n  +  l)*2  -  n] 

+|e"-2(_e"  +  €"+2)2[(„  +  1)fJ  _  (n  _  i)]  +  o(«4"-3) 

=  nf2"-1  -  (2n  +  l)^"4-1  +  (n  +  l)*2”*3  -  ^.-.^c3"-2 

,  "C3”-1)^  _  n(3n  +  l)^n+2  +  n(n+l)^  +  0(e4n-3)> 

2  2  2 

Af2  =  =  [nf2"-1  -  (2n  +  l)*2"4-1  +  (n  +  1 )e2n+3  +  CXe3"-2)]  [1  +  CXe""1 )] 

=  „f2n_1  -  (2 n  +  l)*2"4-1  +  (n  +  ljf2’*4’3  (mod  e3""2), 

f3  =  t2  +  At2  =  f  -  +  fn+2  + ne2n-1  -  (2ti  + l)^"4-1 +(n  +  l)^"4-3, 

/ (*3 )  =  t  (*3+1  +  1)  —  (t3  +  t3)  s=  f  [(t2  +  At2)n+1  +  1]  —  [(t2  +  At2)n  4-  (t2  +  At2)] 

=  e(t?+1  +  1)  -  (tj  +  t2)  +  tJ-lAt2((n  +  l)ef2  -  n]  -  At2  +  0(e5"~4) 

=  /(<2)  -  Af2  +  (<  -  «"  +  en+2)"-1[ne2n-1  -  (2n  +  l)e2"+1  +  (n  +  l)e2n+3] 

•[-n  +  (n  +  lKe2  -  en+1  +  e"+3)]  +  0(e5"-4) 

=  n(n~  H63n-2  +  n(3n-l)c3„  _  "(Sn  +  l)^,  +  »(n+  1)  j„+4  +  0(f4n_3) 
2  2  2  2 
-n2f3"-2  +  n(3n  4-  2)«3"  -  (n  4-  l)(3n  4-  l)*3"4-2  +  (n4  l)2e3n+4  4-  0(e4n“3) 
_  n(3n  -  l)c3n_2  3n(3n  4-  1)^,,  _  (3 n  4-  l)(3n  4-  2) c3n+2 

+  (n  4- 1X3714-2)^  +  0(€4n-3)j 

A<3  =  ~m)"f{t3)  [1+°(f,,"1)] 

—  "(3 n  -  l)f3n.2  |  3n(3n  4-  1)  j,  (3fi  4-  l)(3n  4-  2)  3w4., 

+(n41^n42)<3„+4  (mod  (4n_3) , 

U  =  f  —  4-  «"+2  4-  nf2""1  -  (2n  4-  l)f2n+1  4-  (n  4-  l)f2n+3  -  — 

3n(3n  4-  1)  _3n  _  (3n  +  l)(3n  +  2)^,+2  (n  +  l)(3n  +  2)  3n+4 
2  2  2' 
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Since  U  is  an  0(e4n_3)  approximation  to  the  root  r, 

r  =  U  +  0(e4"-3). 

Recall  that  An  =  r  +  A,  so  the  power  series  of  A  is  needed  as  well.  Notice  that  A  is 
in  fact  the  largest  root  of  /.  Hence  we  can  repeat  Hensel  iteration  with  appropriate  initial 
guess  to  expand  A.  However  it  is  easier  to  invert  it  directly 

i  =  [t,  +  0(e4"-3)]-1 

=  j[l  -  en_1  +  en+1  +  ne2n~2  -  (2n  +  l)e2n  +  (n  +  l)e2n+2 
_n(3n_-^)<3w-3  + 

=  l{l  +  [fn-l  _  fn+l  _  nf2n-2  +  (2„  +  1)f2n  _  („  +  1)e2n+2  +  ?.(3n  ~  Hg3n-3j 


+[<— 1  -  en+1  -  ne2"-2]2  +  e3""3  +  O (e3"-1)} 


=  -{1  +  en_1  -  en+1  -  (n  -  l)e2n-2  +  (2n  -  l)e2n  -  ne2n+2 

+  (n~.1K3,!l.Z-2jf3n-3  +  0(f3n-l)} 


=  A  +  en-2  -  en  -  (n  -  l)e2"~3  +  (2n  -  lje2"-1  -  ne 
+  (n~  l)(3n-  2) <3n.4  +  0(f3n-2); 

l  _ 


2n+l 


(2.18) 


by  the  formula  of  geometric  series  yAy  =  1  +  r  +  r2  +  •  •  •.  Finally  by  (2.15)  we  have 

A„  =  t  +  - 
T 

=  j  +  e  +  e"-2  -  2e"  +  en+2  -  (n  -  l)e2n_3  +  (3n  -  l)e2n-a 

— (3n  +  l)e2n+1  +  (n  +  l)e2n+3  +  (n  ~  1K3n  ~  2)e3n-4  +  0(f3n_2) 

=  {u  ,  ^ ,  ^~h)2  *z±  i  3n~1  3n  +  1  |  "+1 

ijj  3  /j2n-l  ^i2n+l 


(n-l)(3n-2) 


(2.19) 


Similarly  we  can  get  the  asymptotic  expansion  for  A„_i(u;).  This  time  we  consider 
the  smallest  root  t'  of  function 


h(t)  =  e(tn+1  -  1)  -  t"  +  t. 
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The  initial  guess  t'Q  =  0  and  t\  =  <  are  the  same.  Then 

At;  =  en  -  fn+2, 

At;  =  ne2"-1  -  (2n  +  l)«2n+1  +  (n  +  l)*2n+3, 


At,  =  n(3n- 1)  ^  _  3n(3n+l)^„  (3n  +  l)(3n  +  2)  ,3n+2  _  (n  +  l)(3n  +  2)f 

3  2  2  2  2 


3n+4 


by  Hensel  iteration.  It  follows  that 

r'  -  f  +  f"_f"+2  +  ne2n-i  _(2n  +  l)e2n+1+(n  +  l)e2n+3  +  ^^^€3n_2  +  0(e3n)  (2.20) 
and 

1  =  l-fn-2  +  .n_(«_1)f2n-3  +  (2n_1)f2n-1_ng2n+l-(^-jK3n-_2)g3n-4  +  C)(f3n-2) 
T  6  ^ 


(2.21) 


Therefore  by  (2.15) 


A„_i  =  — 


=  i  +  e  -  <”-2  +  2fn  -  fn+2  -  (n  -  l)e2n-3  +  (3n  -  lje2”'1 


-  (3n  +  l)e2n+1  +  (n  +  l)e2n+3  -  — — — — e3"-4  +  0(^n~2) 

* 

=  (L,  |  *)  n-l  3n-l  3-+1  »+l 

K  u  Un  uj2n-3  T  w2n+l  T  w2n+3 

(n^l)(3n  -  2)  +  0, 


2urJr’“4 


(2.22) 


We  used  the  modulo  ei+("-i)(*+i)  representation  in  the  iteration  (2.17).  It  seems 
that  we  implicitly  assume  n  +  2  <  2n  -  1  in  Afi  and  2n  +  3  <  3n  -  2  in  A t2,  etc.  However 
these  assumptions  are  not  required  for  a  legitimate  iteration.  The  reason  is  that  the  extra 
error  terms  won’t  affect  the  order  of  approximation.  Similarly  for  (2.18)  to  (2.22),  several 
terms  overlap  to  each  other  when  2n+3  >  3n  — 4,  but  the  validity  of  the  expansions  remains. 
We  conclude  that  (2.19)  and  (2.22)  hold  for  n  >  2  and 


Theorem  2.1  For  n  >  2,  the  matrix  J„(w i)  has  two  dominant  eigenvalues  that  satisfy 
V  x  2(u>-£)2  (n  —  l)(3n  —  2)  1 


>  2(w  —  —  )2/u;" 


as  tjj 


00. 
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Let’s  look  at  two  cases  when  n  is  small.  We  can  compute  the  eigenvalues  directly 
from  the  characteristic  polynomial.  Indeed 


^(wi) 

has  eigenvalues  u  ±  1  and 

Aj  —  Aj  =  2  = 

J3(wi)  has  eigenvalues 


■c :) 


2(u )  -  -M2  4  1  . 

- - 2^  +  —  +0  —  • 


Aj  =  ^-(uj  —  y/ui2  +  8),  A2  =  lj,  and  A3  =  ^(lj  +  y/u2  +  8) 


2 

with 


•A3  — -Aj  —  —  ( \/u)2  +  8  —  lj) 

= 

lj.  4  8  ,32  1 

—  X  [ — 7 - 7  +  — 7  +  0(  —j)] 


2  u>  : 

2  4  16  1  . 

- 3  +  —J  +  0(—s). 

LJ  UT  UT  U>‘ 

u;J  lj°  lj1 


(2.23) 


2.3  Bounds  for  An  -  An_! 

Although  (2.19),  (2.22)  and  Theorem  2.1  are  asymptotic,  there  are  precise  bounds 
for  An,  An_i  and  An  —  An_j.  We  need  the  following  lemma,  whose  assertions  are  suggested 
by  the  expansions  of  (2.18)  and  (2.21). 

Lemma  2.2  i.  Suppose  n  >  3  and  w  >  3,  then  u  +  <  \  <  lj  + 

**"•  Suppose  n  >  4  and  lj  >  3,  then  lj  -  <  p  <  lj  -  . 

Proof. 

i.  Recall  that  \  is  the  largest  zero  of  /(t)  or  p(t)  -  lj,  where  p  is  defined  by  (2.13).  For 
lj  >  1,  equation  p{t)  =  lj  has  only  two  real  roots,  i.e.  r  e  (0,1)  and  \  6  (l,oo). 
Hence  it  suffices  to  show 

2  1 

P^+3^7)  <u><p(u;+— ), 


31 


then  by  Intermediate  Value  Theorem 


<  -  <  u  + 


Take  rj  =  u  +  ^2_r  for  short,  then 

p(T})  <  u  <=> 


T)n+1  +  1  <  +  UT) 

(t)  -  u>)r)n  <  ur)  -  1 

221  <  w(w  +  -2_.)  _  i 

1  wn-1 

2(")"  <  W  +  — 2y  —  — 
w  w"  1  a; 

2  12 
2(1  +  —r)n  <  w - 1 — —r 


(2.24) 


Since  un/n  increases  with  n  and  for  u>  >  2,  and  33/3  =  9  >  8, 

un  >  8n  if  n,  u;  >  3. 

Also  notice  that  the  sequence  (1  +  ^)fc  is  strictly  increasing  with  limit  e.  Hence 

(1  +  ~)k  <  e  for  k  >  0. 
k 

The  inequality  (2.24)  is  always  true  since 

2(1  +  —)n  <  2(1  +  2-)n  <24/e<3-i<w-l  +  -2— 

v  wn  v  8n  3  w  u.’n"1 

Similarly  for  the  other  inequality,  substitute  £  =  u  +  ,  then 


p(0  >  u 


({  -u)C>ut-  1 
•2—  >  u(u  +  u>2~")  -  1 

(i)n  >  1  +w1_n  -  w'2. 

u 


The  last  inequality  is  always  true  since 


(1  +  w1"")"  >  1  +  nw1'"  +  •  •  •  >  1  +  nw1*"  >  1  +  w1-n  -  w-2. 


ii.  Recall  that  q  is  defined  by  (2.14)  and  for  u  >  1  equation  g(f)  =  u  has  only  two  real 
roots,  i.e.  t'  €  (0,1)  and  $7  €  (l,oo).  Let  =  u>  -  then 


9(0  <  «*> 


(On+I-i<«(rr-«r 
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<**  - 1  <  (w  -  W)n 

«T 


ul  -  u3~n  -  1  < 


n-2 


1-u1-"  -U~2  <  (¥- )" . 

u 


(2.25) 

We  need  following  inequalities,  which  can  be  proved  by  calculus  or  by  induction, 

(1  -  £)"  >  1  -  nS  for  n  >  1  and  0  <  6  <  1, 

un~3  +  1  >  n  if  n  >  4  and  u>  >  3. 

Indeed  (2.25)  holds  since 

(1  -  u>1~n)n  >  1  -  nu1-"  >  1  -  (un~3  +  l)u'-n  =  1  -  cJ1_n  -  uT2. 

For  the  other  inequality,  consider  rj'  =  u  —  tj2~t  i  then 

q(r]')  >  u  <=>  un i  -  1  >  (w  -  T]')(rj')n 

2  x  -  2(r/,)n 

^  ~  3^)  " 1  >  -^rrr 

<=*>  w  —  ~  r  >  2(— -)". 

u;  u;"-l  v/,j' 


u; 


Obviously  the  last  inequality  is  true  since 

3  ~  33 


u  -  -  -  — -—r  >3-i-^>2>2(l-  — )". 
w  w*»-l  3  /.}" 


We  remark  that  Lemma  2.2  (ii)  fails  when  n  =  3,  since 

_1_  1 _ 1 _ _ i_ 

T1  u}  2 

by  (2.21).  Now  the  following  theorem  can  be  derived  from  Lemma  2.2. 

Theorem  2.3  Let  An  and  An_i  be  the  dominant  eigenvalues  of  J„(wi)- 
Assume  w  >  3,  then 

i.  for  n>  2,  w  +  w+4J,5_n  +  <  An  <  w  +  w+3i;Un  +  j^rr/ 


ti.  /or  n  >  4,  u>  +  ^rsbi-S  ~  <  *n-i  <  w  +  - 


U)n— 1  * 


ui.  /or  n  >  3, 

^  r .  in  —  l' 


U)n 


■  u;n(i-wJ-2n; 


<  A„- An_x  < 
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Proof. 

i.  Assume  n  >  3,  then  by  Lemma  2.2  (i) 


1  1 
- n —  <  r  <  - — - — 

u>  +  u2~n  u  +  2o;1_n 


12  1  11 

u)  -l  .  4-  . .  <  r  4-  —  <  u>  4- - - +  . 

Ijj  4-  Ul2~n  U)n~ 3  T  U  4  ~n  (jjn  —  2 

The  desired  inequality  follows  from  (2.15).  It  is  trivial  for  the  case  n  =  2  since 

1  2  ,  1 
u>  4- - —  4-  —  <An  =  u;  +  l<u;  +  — — — -  4-  1. 

u>  +  1  u)  u)  +  2 a;-1 


ii.  Similarly  by  Lemma  2.2  (ii) 


- — —  <  t'  <  - - — 

u  -  2u>1~n  o>  —  u>2~n 


u  —  2w1_n  un~2 
iii.  According  to  (i)  and  (ii) 

A„  —  ^n— 1  ^ 


lx  /l 

— r  <  An_i  =  r  +  —  <  a;  + 


—  u2~n  lJn-l  ’ 


wn-l 

4 

a;  +  a;2-"  u 
2 

tjn-l 

a;n(l  —  a;2-2") 

4 

2a;  2 

yn-1 

a;"  —  a;"-1  ’ 

1  —  2  <  w 

!-2n  k  J 

~  1 

if  n  >  4,  since 


Similarly 

_A  2  1 _ 1 _ 2 _ 4 _  2 

”  n  UJn~ 2  UJ  +  2w1_n  LJ  ?'•  ’3~n  ^jn~ 2  n -f  1  ( 1  4^. — 2 n  )  ^n— 2 

The  case  n  =  3  comes  from  straight  computation.  In  view  of  (2.23),  we  want 

4  2  1  /-r — -  .  2  4 

~ 2 - tt; - 77  <  ;  VW2  +  8  ”U>)  < - — — - — — 

a;2  o;3(l  -  a;-4)  2  a;  w4(l-4w6) 

8  4  y-r - -  4  8 

<=>  w  H - - - r - -  <  VU2  +  8<W+ - - - - - r 

U)*  —  u>” 1  U; 

—  ®1  16  16  8a;  64 

a;4  (or3  —  a;-1)2  a;  a;3  —  a>_1  a;2 (a;3  —  a;-1) 

16  64  16a;  64 

<  a;2  (a;4  —  4a;-2)2  a;4  —  4a;-2  a;(a;4  —  4a;-2) 


(2.26) 
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The  first  inequality  in  (2.26)  holds  since 

2 


2  8 
+  -  <  77  + 


w*  i)2  w  34  (33  -l)2  3 

and  the  second  one  holds  since 

02  4  1.1 


+  1  <  1. 


024  —  4 u2-2  u(uA  —  4o2-2 )  u>2  02 


_ _ _  _  r - —  + 

—  4o2-2l  u;2  w  -  4o2-5 


2/1  _4,  J_ 

w3  _  ^-3  J  <  u/2  (2  +  26'  <  w2' 


2.4  Dual  matrix  J„( wi*) 

The  dual  matrix 


*^n(wl’)  =  tridiag 


'  1  1  •••  1  1  \ 

0  u . 02  0 

V  i  i  •••  i  i  ) 

shares  the  same  gap  with  Jn(wi ).  Therefore  Jn(w i")  is  an  extremal  matrix.  However 
all  of  its  eigenvalues  can  be  recovered  from  those  of  Jn(wi).  Indeed  by  Theorem  1.4  (iii) 

Wn(wx’))  =  w  -  An— <( J«(wi))  for  t  =  1,2, . . .,  n.  (2.27) 

In  view  of  (2.19),  (2.22)  and  (2.27),  J„(wi*)  has 


'M 

and 

^2 


1 

_  ii. 

-  A-)2 

CJ  ' 

n  —  1 

371-1 

3n  +  1 

1 

71+1 

(n-  l)(3n-  2) 

u> 

U2n 

+  ^n-3 

022""1 

T  U)2n+l 

(*;2n+3 

2o23"-4 

1 

,  (w 

-  -M2 
02  / 

n  —  1 

3n  -  1 

3n+  1 

n  +  1 

(n-l)(3n-2) 

02 

+  — 

02" 

+  u,2n-3 

w2n-l 

w2n+l 

(*>20+3 

2023"--* 

+  0{- 


1 


,.3n  — 2 


+  0(- 


1 


.3n— 2 


asymptotically.  Similarly  we  have  counter  parts  of  Theorem  2.1  and  2.3. 

Corollary  2.4  Let  X\  and  A2  be  two  smallest  eigenvalues  of  Jn( wj*),  then 
i.forn>  2,  A2  -  At  =  4-  1  +  as  u>^cc; 

it.  for  n  >2  and  w  >  3,  -J+lbi"  "  <  At  <  -j^rr  -  u+v*-”  • 

iii.  for  n  >  4  and  u  >  3,  +  uhr  <  A2  < 

iv.  for  n  >  3  and  u  >  3, 

2  _ 4 _ 2 _ ^  .  <  _2 _ 1 _  <  2 

wn-l  <  wn-l  u/n(l  —  U2~2n)  2  1  02n_2  U2n+1(l  —  4o2_2n)  u>n~2 
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Chapter  3 

Ratios  of  Eigenvectors 


Important  to  our  minimization  problem  are  the  results  on  ratios  of  entries  of 
eigenvectors,  the  subject  of  this  chapter.  We  consider  unreduced  symmetric  tridiagonal 
matrices 

/  01  •••  0n-\  \ 


r„(a)  =  tridiag 


a(l) 


a(n) 


\  0\  0n~  1  ) 

with  the  diagonal  a  =  (a(l),a(2),...,a(n)),  and  all  the  Pi's  positive. 

Assume  T( a)  has  eigenvalues  Ai  <  A2  <  •  •  •  <  An  .  Let  v;  =  (v,(l),  i>,(2), . . . ,  r,(n)) 
be  an  eigenvector  corresponding  to  A,,  and  Sj  =  (s,(l),  s;(2), . . .  ,s,(n))  be  the  normalized 
eigenvector.  Notice  that  vi(i),  v„(i),  v,(l)  and  v,(n)  axe  nonzero  for  all  i’s  [9,  7-9-5].  Hence 
we  assume  all  v,(  l)’s  are  positive  in  the  whole  chapter,  though  the  theorems  apply  to  general 
r,(l)’s. 


For  i  =  1, 2, . . . ,  n,  we  can  write  T(a)vj  =  A^vj  as  a  system  of  linear  equations 


Pk-iv,(k  -  1)  +  (a(k)  -  A i)Vi(.b)  +  Pk Vi{k  +  1)  =  0, 


(3.1) 


where  v,(0)  =  u,(n  +  1)  =  0.  From  (3.1)  there  cannot  be  two  consecutive  zeros  in  an 
eigenvector.  If  w,(fc)  ^  0,  then 


Pk- 


v‘(k  -1)  ,  a  Vi{k+1) 


1  ®i(*) 


+  0k- 


Vi(k) 


=  A,  -  a(k) 


This  ratio  equation  is  the  key  tool  that  leads  to  all  the  theorems. 


(3.2) 
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3.1  Ratio  of  vn_i  and  vn 


(3.2)  governs  the  ratios  of  consecutive  entries  of  an  eigenvector.  It’s  not  surprising 
that  our  first  lemma  uncovers  the  relation  between  the  ratios  of  adjacent  entries  of  v;  and 
those  of  vj. 


Lemma  3.1  Assume  j  >  i  and  Vi(k)vj(k)  >  0  for  k  =  1,2, •••,/,  then 


Pj(fc) 

Vj(k  -  1)  Vi(k  -  1)' 


for  k  =  2,3, . ..,f  +  1. 


(3.3) 


Proof.  Recall  that  X,  >  Aj.  For  k  =  2, 

»i(2)  _  Xj  -  a(l)  Xj  -  a(l)  _  t>,-(2) 
Uj(l)  01  01  Vi(l) 

by  (3.2).  Suppose  (3.3)  holds  for  some  k  <  £,  then 

_  ~  1)  Vi(k-  1) 

Vj(k)  v,(k) 


(3.4) 


since  both  ratios  in  (3.3)  axe  positive  by  assumption.  In  view  of  (3.2), 

Vj(k  —  1)  t?,(fc+l)  1)  v,(k  +  1)  _ 

"  Vj(k)  +0k  Vj(k)  ~  >-«(*)>  A«  -“(*)“  0*-i  Vi(k)  r.(fc)  ’ 


Multiply  (3.4)  by  0k~\  and  add  it  to  (3.5),  then 


Vj(k  +  1)  Vj(k  +  1) 
Vj(k)  vi(k) 


as  desired.  By  induction  (3.3)  holds  whenever  Vi(k)vj(k)  >  0. 


□ 


If  we  have  Vi(k)vj(k)  <  0  in  Lemma  3.1,  then  the  inequality  (3.3)  reverses.  Also  it 
is  possible  to  replace  k  =  1, 2,  •  •  • ,/  by  k  =  l V  +  1, . . . ,  n  in  Lemma  3.1.  Here  comes  our 
main  theorem,  which  states  that  the  element-by-element  ratio  of  vn_i  and  vn  is  strictly 
moi.otonic. 


Theorem  3.2  Under  the  normalization  r„_i(l)rn(l)  >  0,  then  is  a  strictly  de¬ 

creasing  sequence  in  k. 
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Proof.  Without  loss  of  generality,  we  may  assume  v„(l)  and  v„_i(l)  are  positive.  Then 
v„(k)  >  0  for  all  k’s  since  vn  has  no  sign  change.  Because  vn_i  has  exactly  one  sign 
reversal,  there  is  an  integer  l  <  n  —  1  such  that 


Un— l(l)  ^  0)  •  ■  • » ^n— l(f  —  1)  > 


o,  Vn-i(f)  >  o,  +  1)  <  0, . . . ,  V„_i(n)  <  0. 


Take  i  =  n  —  1  and  j  =  n  in  Lemma  3.1  to  find 

Vn-l(k) 

Vn(k-  1)  -1)’ 

Vn-l(k  -  1)  ttn  — 1  (k) 

vn(k  -  1)  vn(k) 

for  k  =  2, 3, . . . ,  i.  Hence 


ttn-l(l)  «n-l(2)  ^  ,  Vn-l(f  +  1) 

«„(1)  un(2)  vn(l)  -  n„(f+l)  • 


(3.6) 


The  rest  of  inequalities  can  be  proved  by  the  symmetry  of  the  eigenvectors.  Recall 
/  from  Theorem  1.4,  then  T  :=  JT(a)/  just  reverses  the  diagonals.  By  Theorem  1.4,  T  has 
the  same  eigenvalues  {A,},  but  eigenvectors  {vj*}  in  reverse  order,  i.e. 


»*(*)  =  Vi(n  + 

1  - 

r-4 

II 

• 

<2 

j? 

2,.. 

• ,  n. 

(3.7) 

Notice  that  v^(k)  > 

0  for  all  k,  and 

V* 

.t(k)  <  0  for  k  = 

1,2, 

...,n  - 

£.  Apply 

(3.6)  to 

eigenvectors  and 

~ vn_l ’ tien 

tftt) 

vn  (2) 

■  > 

.^(n-f)  >0 

r«(n  - 1) 

>  - 

Vn-l(" 

t’«(n  - 

-1+  1) 
-£+  1)  ' 

Therefore  by  (3.7) 

fn-ljn)  Vn— t(n-  1)  Vn-l(f-i-l)  t>n— l(l) 

Un(n)  «n(n  -  1)  vn{(  +1)  “  v„(f)  ’ 

which  concludes  the  proof.  □ 


We  remark  that  is  strictly  increasing  in  k  when  v„_i(l)v„(l)  <  0.  Needless 

to  say,  the  normalized  eigenvectors  sj’s  obey  Lemma  3.1  and  Theorem  3.2.  However  we 
obtain  more  by  using  the  normalization  of  sj’s. 


Corollary  3.3  For  n  >  3,  either  |sn_i(l)|  >  jsn(l)|  or  Isn-^n)!  >  |s„(n)|. 
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Proof.  Suppose  sn(l),  ^n-i(l)  >  0  and  the  assertion  is  false,  i.e. 


a„(l)  >  an-i(l)  and  -  a„_i(n)  <  $„(n). 


Thus  by  Theorem  3.2 


4  ^  sn-l(l)  _  ^n— 1  (2)  ^  \  , 

Hence  for  all  k's 

1  *  9jrn£  or  M*)l  * 

with  at  least  one  strict  inequality.  Then  we  have  a  contradiction 

1  =  J2  3n(fc)2  >  •Sn-lW2  =  1- 
fc=l  k= 1 


By  Corollary  3.3  and  ] |sn| ]§  =  l!sn-ll|2  =  there  exists  an  integer  k  such  that 
|s„(A:)|  >  |sn_i(A;)|.  More  precisely  we  have 


Corollary  3.4  Suppose  n  >  3  and  sn_i(f)s„_i(£  +  1)  <  0  for  some  integer  l,  then  either 

M*)|  >  |a„-i(0|  or  | sn(£  +  1)|  >  | *._,(<  +  1)|. 

Proof.  Again  assume  s„(l),  s„_:t(l)  >  0,  s„_i(f)  >  s„(t)  and  s„(f  +  1)  <  +  !)• 

Then  by  Theorem  3.2 

■Sn-l(l)  3n-l(l)  >  i  ^  1(^  .  3n— 1  (rt) 

5„(1)  sn(£)  -  -  sn(f+l)  s„(n) 

i.e.  |s„_i(fc)|  >  \sn(k)\  for  k  =  1,2, ...,n 

with  at  least  one  strict  inequality.  Therefore 

1  =  £  s„_i(fc)2  >  ]T  s„(fc)2  =  1. 
fc=i  fc= i 

which  is  a  contradiction.  Hence  sn-i(k )  >  s„(k)  for  k  =  l  or  l  +  1.  □ 


We  remark  that  Corollary  3.3  and  Corollary  3.4  may  fail  for  the  case  n  =  2.  For 
example  Ji( 0)  has  eigenvalues  ±1  and  corresponding  eigenvectors  ^75)  and  (^-,  ^»). 
This  is  a  counterexample. 

To  illustrate  Theorem  3.2  and  Corollary  3.3-4,  consider  Jio((2, 0, . . . ,  0, 2)),  which 
has  ratio  vector  S9(k)/sio(k) 

(1.0098, 1.0010, 0.9579, 0.7953, 0.3416,  -0.3416,  -0.7953,  -0.9579,  - 1.0010,  - 1.0098), 


Figure  3.1  gives  another  example. 


Figure  3.1:  of  /^((0, 0,3, 0,0, 3, 0,0, 0,3)). 


3.2  Sign  Pattern  of  V<7„_i 

Recall  that  the  separation  j„_i(T(a))  :=  A„(T(a))  —  An_i(T(a))  is  a  real  valued 
multi- variable  function.  Its  gradient  is  given  by  Lemma  1.6 

V^CTXa))  =  (s„(l)2  -  3n_x(l)2,sn(2)i  -  s„_i(2)V..,s„(n)a  -s^n)2).  (3.8) 

As  a  consequence  of  Corollary  3.3  or  3.4,  V<7„_i  ^  0  for  n  >  3  and  all  a’s.  This  certainly 
is  not  true  for  n  =  2,  ^(0)  in  the  previous  section  is  an  example. 

Assume  n  >  3  and  sn(l),  Sn-^l)  >  0  in  the  sequel.  According  to  Corollary  3.3, 
there  are  three  cases 

I.  s„_!(l)  >  s„(l)  and  -s„_i(n)  <  s„(n); 

n.  s„-i(l)  <  Sn(l)  and  -s„_i(n)  >  s„( n); 

ID.  s„-i(l)  >  s„(  1)  and  -Sn-i(n)  >  s„(n). 

Combined  with  Theorem  3.2,  each  case  gives 


T 

•"S 

•n— l(»  +  l 

!>...> 

•n-l(n) 

i* 

*n(«)  >  1  ^ 

#n(«+l J 

»n(n) 

^  11 

TT 

•n-lU)  ^ 

,  , , ,  s 

*n-l  (n). 

u. 

1  ^  *n(l )  ' 

^  —  1  / 

•nM  > 

»«(")  ’ 

ITT 

>  »n-l(0 

v. 

1  n.  *n-l(*'+l) 

\  ,  s 

•»-i(;-i) 

>  —1  ' 

ui. 

1  -  *»(l+l) 

•n{j~  1) 

^  1  . 

>  nr 

for  some  integers  i  and  j.  Therefore  the  gradient  (3.8)  has  corresponding  sign  patterns 
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I.  .,^—^5+0,+, ..  .,-f,+0); 

i-th 

H.  (+0, +0,^-^, 

j-th 

in.  j>  +o,  +,•••,+,  +o,^— j . 

i-th  j-th 

Here  u+ 0”  means  the  entry  can  be  positive  or  zero. 

Recall  from  the  end  of  last  section,  the  ratio  vector  sg(k) / si0(k)  of 
*^io((2,0, ...,0,2))  is  skew  symmetric.  This  is  true  for  all  centrosymmetric  tridiagonal 
T(a)  since  its  eigenvectors  are  either  symmetric  or  skew  symmetric  by  Corollary  1.10  (i). 
Therefore  we  can  replace  ”or”  by  "and”  in  Corollary  3.3  and  3.4. 

Corollary  3.5  Assume  T(a)  is  centrosymmetric  and  n  >  3,  then 

i.  |s„-i(l)|  >  |sn(l)|  and  |sn_!(7i)|  >  |s„(n)|; 

ii.  \sn(k)\  >  |sn_x(*)|  for  k  s  [f ],..., [2f2]. 

Only  case  (III)  above  is  possible  for  centrosymmetric  T.  The  gradient  Vgn_j  is 
also  symmetric  about  the  center  and  has  the  sign  pattern 

^9n-i  =  (~i  ••  +0>+>  •••>+>  +0,  ^  (3.9) 

»-th  (n+l  — «)-th 

Now  we  are  ready  to  prove  Ashbaugh  and  Benguria’s  Comparison  Theorem  [1, 
Theorem  5.1].  They  only  prove  it  for  J( a),  but  in  fact  it  is  true  for  all  centrosymmetric 
T(  a). 

Definition  4  A  vector  u  =  (ti(l),u(2),. . .,  u(n))  is  symmetric  increasing  (from  the  center ) 

*/ 

»(!)  =  «(»)  >  «(2)  =  «(»  -  1)  >  •  •  •  >  «([^±1))  =  «(I§)  +  1). 

Theorem  3.6  (Ashbaugh  and  Benguria)  Assume  T( a)  is  centrosymmetric  and  u  is 
symmetric  increasing,  then 


gn-i(T(*  +  u))  <  gn-\(T{ a)). 

Equality  holds  if  and  only  if  all  the  entries  of  u  one  the  same. 


(3.10) 
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Proof.  The  case  n  =  2  is  nothing  but  a  shift  (or  translation)  of  T,  so  equality  always 
holds.  Assume  n  >  3.  By  the  Mean  Value  Theorem,  there  exits  r?  €  (0, 1)  such  that 

5„_i(T(a  +  u))  -  0n_i(T(a))  =  V^„_|(T( a  +  7711))  •  u.  (3.11) 

By  assumption  a  and  u  are  centrally  symmetric,  then  so  is  a  +  qu.  Thus  T( a  +  7711))  is  also 
centrosymmetric  and  we  can  assume  V^n_:(r(a  +  tj u))  has  sign  pattern  (3.9)  for  some  i. 
Now  we  translate  u  to 


u'  :=  u  -  u(t  4-  l)e, 

where  e  =  (1, 1, ... ,  1).  Since  u'(k)  =  u(k)  -  u(i  +  1)  for  k  =  1,2, . .  .,n,  u*  has  the  sign 
pattern 

u'  =  (+0,...,^  0,-0, ...,-0,0,  +0  ,...,+0).  (3.12) 

fth  (n+l— »)-th 

by  the  symmetric  increase  of  u.  Again  ”+0”  means  the  entry  >  0  and  ”-0”  means  <  0. 
Match  the  patterns  of  (3.9)  and  (3.12)  to  get 

Vff„_i(r(a  +  7711))  •  u'  <  0,  (3.13) 

Hence 

Vsn-i(r(a  +  77U))  •  u  =  V<7„_j(T(a  +  77U))  ■  u'  +  u(i  +  l)V^„_j(r(a  +  77U))  •  e  <  0 

since  the  last  term  vanishes  by  Corollary  1.7  (ii).  Then  (3.10)  follows  by  (3.11). 

If  u  =  <7e  for  some  scalar  <7.  then  (3.10)  becomes  equality  by  Corollary  1.5  (i).  For 
another  direction,  suppose  we  have  equality  in  (3.10),  then  so  is  (3.13) 

n 

Yl(ku'(k)  =  0  (3.14) 

fc= 1 

where  V^n_j(T(a  +  77U))  =  (£j,&» . . Since  (3.14)  has  no  positive  term,  fou'(Jt) 
vanishes  for  all  fc’s.  Notice  that  ft  <  0  and  fjn+i  j  >  0  by  (3.8)  and  Corollary  3.5.  Therefore 

u'(l)  =  «'([^])  =  0  or  «(1)  =  «(«  +  1)  =  u([~^])- 


By  symmetric  increase  of  u,  all  the  entries  of  u  are  equal. 


□ 
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3.3  Unbalanced  Diagonals 

In  this  section  we  consider  T( a)  with  unbalanced  diagonals.  We  assume  the  right 
side  is  heavier  than  the  left  for  the  main  diagonal  and  two  subdiagonals,  though  the  reverse 
case  can  be  treated  similarly.  The  ratios  of  two  consecutive  entries  of  an  eigenvector  obey 
a  certain  rule,  which  will  be  explored  here.  We  start  with  the  following  lemma. 

Lemma  3.7  Let  vj  be  the  i-th  eigenvector  ofT(a).  Assume 

{t>,(l),  t?,(2), . . . ,  Vi(£)  have  the  same  signs , 
and  so  do  v,(n  +  1  -  t ),  V{(n  +  2  —  £),...,  t»,(n); 

a(k)  <  a(n  4-  1  -  k)  for  k  =  1,2, 

0k  <  0n-k  for  k  =  1.2 . i; 

for  some  l  <  [  j],  and  j  is  the  first  index  k  that  gives  strict  inequality  in  (3.16)  or  (3.11). 


Then 

i. 

Vj(k+1)  _  V, (n— fc) 

v,(k)  i>,(n+l-k) 

for  k  =  l,2,...,min{;  -  l.f}; 

ii. 

^  v.(n-fc) 

Ui(fc)  *  Mn+l-*) 

for  k  =  j,j+  !,...,£. 

We  give  two  remarks  before  the  proof.  In  the  case  j  =  1,  (i)  is  not  necessary  in  Lemma  3.7. 
If  equality  holds  everywhere  in  (3.16)  and  (3.17).  then  we  can  take  j  =  C  +  1  and  part  (ii ) 
is  redundant. 

Proof.  Since  neither  r,(l)  nor  r, ( n )  vanishes, 

Vi(k)  ^  0  for  all  k  =  1, . . . ,  C  and  n  +  1  —  n 

by  (3.15).  Hence  it  is  legitimate  to  take  ratios 

v,(fc  +  l)  _ j  Vi(n-k) 

Vi{k)  Vi(n  +  l-k) 

for  k  =  1,2, Moreover  both  ratios  in  (3.18)  are  positive  for  k 
(3.15). 

i.  Suppose  j  >  1,  then  by  assumption 


(3.18) 

=  1,2 . 1-  1  by 


(3.15) 

(3.16) 

(3.17) 


a(k)  =  a(n  +  1  —  k)  and  0k  =  0n-k  for  k  =  1, 2, . . . ,  j  -  1. 


(3.19) 
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Our  induction  argument  begins  with 


Vj(2)  _  A,  -  a(l)  _  Xj  -  q(t)  _  u,(n  -  1) 
Vi(l)  0\  0n—\  Vi(n) 


by  (3.2)  and  (3.19).  Now  assume 


Vi(k)  _  Vj(n  -  k  +  1) 
Vi(k  -  1)  t),(n  -  k  +  2) 


for  some  k  <  j,  then  by  (3.19) 

*  v.(*-l)_o  Vi{n-k  +  2) 

0k- 1 - 7TT—  =  Pn+\-k—, - 7---TT  (3.20) 

Vi(k)  v,(n  -  k  +  1) 

Use  (3.2)  and  (3.19)  again  to  get 

«.(*)  «,(*:) 

X  ^  „  Vi(n-k)  v,(n  +  2  -  k) 

—  —  <*(n  +  1  —  *)  —  Pn-k — ; — - rr+Pn+l-fc — ; - 77  •  (3-21) 

t’,(n  +  1  -  k)  Vi(n  +  l-k) 

Subtracting  (3.20J  from  (3.21),  we  have 

vt(k  +  1)  _  t>,(n  -  k ) 

v.(*)  ~Vi(n-k  +  l) 

by  (3.19).  Part  (i)  follows  by  induction  until  k  =  j  or  there  is  a  sign  change. 


ii.  Suppose  j  <  l.  From  (i) 


VtU)  _  v,(n  +  1  -  j) 

v,(j  ~  1)  v,{n  +  2  -j)' 


a  v>U~l)  *  v,(n  +  2-j) 

v,(j)  v,(n+  1  -  j) 


(3.22) 


According  to  (3.2)  and  (3.16) 

4  ViU  “  „  ViU  +1)  x 

Pj- 1 - 7TT -  +  0j - 777 -  =  A,  -  a(j) 

Vi(j)  Vi{j) 

>-  +  <323> 
Subtract  (3.22)  from  (3.23),  then 


a  v«(>  ±  ])  ^  4  Pj(n  -  j) 

3  ViU)  -^~3Vi(n  +  l-jy 


(3.24) 


i>  JL. 

0}  0n-j 


By  (3.17) 


(3.25) 


44 


Since 


a(j)  <  a(n  +  1  -  j)  or  fi}  < 


inequalities  (3.23)  and  (3.24)  are  strict,  or  (3.25)  is  strict.  Multiplying  (3.24)  and 
(3.25),  we  have 

+  1)  >  p»(w  -  j) 

Vi(j)  v,(n  +  1  —  j) 

Assume 

v,(k)  Vi(n  +  l-k) 

Vi(k-l)  >  Vi(n  +  2-k) 
for  some  k  <  l,  then  by  (3.17) 


~  &k-\ 


Vj(k  -  1) 
Vi(k) 


> 


~fin+\-k 


Vi(rt  +  2  -  k) 
Vi(n  +  1  -  k) 


(3.26) 


Similarly  by  (3.2)  and  (3.16) 


*  !)  ,  *  v,(k  +  l)  _  x 

Pk- 1 — —7T\ - ^  Pk—rrr~  ~  A«  ~  a(k) 


Vi(k ) 


=  A,  -  a(n  +  1  -  fc)  >  fin—k 


Vj(n  -  *0 
r,(n  +  1  -  A:) 


+  Pn+l-k 


Vi(n  +  2  -  k) 
Vi(n  +  1  -  k)' 


(3.27) 


Add  both  (3.26)  and  (3.27),  then  multiply  it  with  ^  to  get 


v,(k  +  1)  u,(n  -  k ) 

Vi(k)  Vi(n  -  k  +  1)’ 

Below  we  consider 

a(k)  <  a(n  +  1  -  k)  for  k  =  1, 2, . . .,  [^-J,  (3.2S) 

0k<0n-k  fork  =  1,2 . [j],  (3.29) 

and  apply  Lemma  3.7  to  vn  and  vn_i.  If  at  least  one  of  the  inequalities  in  (3.28)  or  (3.29) 
is  strict,  then  positive  vn  is  heavier  in  the  right  side  and  the  broken  line  graph  of  vn_i 
crosses  zero  after  the  midpoint 


Theorem  3.8  Assume  (8.28)  and  (8.29)  hold,  and  j  is  the  first  k  that  gives  strict  inequality 
in  (8.28)  or  (8.29).  Then 


i. 


ii. 


*>«(*)  Vn(n-fc-fl) 


> 

VfJk) 


vn(n-k) 
un(n-*+l ) 


for  k  =  1,2,...,  j  -  1; 
fork  =  j,j+  1 . [f]; 
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tit.  if  vn  is  positive,  then  for  k  =  1,2,...,  [^] 

v„(k)  <  vn(n  +  1  -  k).  (3.30) 


Proof.  Since  vn  has  no  sign  change,  (3.15)  is  true  for  l  =  [|j.  Now  (i)  and  (ii)  axe 
exactly  Lemma  3.7. 

Consider  (iii).  From  (ii)  with  k  =  [j] 


Thus  for  odd  n 


Pn([f]  +  1)  »n(«  ~  [f]) 

"„([?])  vn(n  +  1  —  [j])  ' 


Vn(^)  Vn(afi) 

«„(^)  vn(*p) 


n  -  1  n  +  3 

Vn(—Z—)  <  u„(— — ), 


and  for  even  n 


+  1)  p«(f) 

*n(?)  «n(f+l) 


/  n .  ,  n  y 

U„(-)  <  vn{-  +  1). 


Hence  (3.30)  holds  for  k  =  (|j.  Now  we  can  prove  this  part  by  induction  backward  on  k. 
Assume 


vn(k  +  1)  <  vn(n  -  k) 


(3.31) 


for  some  k  >  1.  In  view  of  (i-ii),  we  have 

vn{k)  <  v „(n  -  k  +  1) 
vn(k  +  1)  “  v„(n  -  k) 

Multiply  (3.31)  and  (3.32),  then  (3.30)  follows. 


(3.32) 

□ 


In  Theorem  3.8  we  assumed  at  least  one  strict  inequality  in  (3.28)  or  (3.29).  Sup¬ 
pose  they  axe  all  equalities,  then  Theorem  3.8  (i)  holds  for  all  k  =  1,2, . . .,  [|]  and  (ii)  is 
redundant.  As  in  the  proof  of  Theorem  3.8  (iii),  we  can  show 


vn(k)  =  i >„(n  +  1  -  *)  for  k  =  1,2, ...,[|]. 

This  gives  the  symmetry  of  vn  for  centrosymmetric  T,  which  is  not  new  of  course. 

The  next  theorem  considers  vn_i.  Notice  that  Lemma  3.7  is  valid  until  it  encoun¬ 
ters  the  sign  change  of  vn_i.  The  broken  line  graph  of  vector  v  is  a  piecewise  linear  curve 
that  goes  through  all  the  nodes  (k,v(k)). 


Theorem  3.9  Assume  (3.28)  and  (3.29)  with  at  least  one  strict  inequality.  Then 
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i.  the  zero  7  of  the  broken  line  graph  of  vn_j  satisfies  7  >  2^ 

ii.  if  rn_i(l)  is  positive,  then 

t>„_i(2^i)  >  0  for  odd  n,  and  vn_i(f )  +  u„_i(5  +  1)  >  0  for  even  n; 


tt„-l(fc+l)  t'n-l(n-fc) 

«n-l(k)  —  «»_v(Tl-fc+l) 


fork  =  1, 2, . . n  —  [7]. 


Proof.  Without  loss  of  generality,  we  can  assume  un_j(l)  >  0. 
Suppose  7  <  Define 

(  7  —  1,  if  7  is  an  integer; 


J  7  -  1,  if  7  i 
I  [7],  othei 


otherwise. 


Then 


t*n— i(l)  >  0, .  .  . ,  >  0,  +  1)  <  0,un_i(f  +  2)  <  0,.  <  0. 


Hence  by  Lemma  3.7 


1>n-l(f  4-  1)  Pn-l(«  ~  t)  „ 

“  «„-!(/)  -vn.l(n-t+ 1)  ’ 


which  is  impossible. 

Suppose  ^  <  7  <  2^.  When  n  is  odd,  we  have 

/n-1\  „  ,n  +  l.^_  ,  ,n  +  3. 

vn-i(— — )  >  0, u„-i(  ~  )  <  0,  and  rn_!(— — )  <  0- 

Since  there  is  at  least  one  strict  inequality  in  (3.28)  or  (3.29),  by  Lemma  3.7 

0  > 

which  is  a  contradiction.  When  n  is  even,  we  have 

Vn-i(^)  >  0,vn_i(^  +  1)  <  0,  and  r„_i(^)  +  r„_i(^  +  1)  <  0. 

Again  we  have  a  contradiction  by  Lemma  3.7 

un— 1(7  +  1) 

-  r„_,(2)  *.-i(f  +  1)-“ 

Therefore  7  must  be  larger  than  2±1.  Since  the  broken  line  graph  of  vn_1  is 
positive  at  the  midpoint  ,  (ii)  follows.  Part  (iii)  is  exactly  Lemma  3.7.  □ 


Consider  Jio((2, 2, 0,0, 0,0, 0,0, 2.1, 2))  as  an  example.  Its  eigenvectors 
s10  =  (0.0089, 0.0111, 0.0049, 0.0049, 0.0110, 0.0308, 0.0891, 0.2584, 0.7498, 0.6014), 
S9  =  (0.6177, 0.7359, 0.2592, 0.0912, 0.0320, 0.0108, 0.0024,  -0.0031,  -0.0123,  -0.0103) 
illustrate  well  Theorem  3.8  and  Theorem  3.9.  Notice  that  j  =  2  and  7  <  7  <  8. 


47 


3.4  Convexity 

In  this  section  we  consider  T( a)  with  partially  symmetric  diagonals.  We  have 
following  convexity  property  for  the  entries  of  an  eigenvector. 

Theorem  3.10  Assume 

a(k)  =  a(n  +  1  -  k)  and  0k  =  /?„_*>  0  for  k  =  j  +  1,  j  +  2, . . . ,  [-y-]> 

a{k)  +  2/3k  <  K  for  k  =  j  +  l,j  +  2,...,[^y^], 

for  some  i  <  n  and  j  <  f2^-].  Under  the  normalization 

>  0,  when  n  is  odd; 

u«(f )  +  +  1)  >  0,  when  n  is  even; 

we  have 

Pj [viti)  +  ®i(«  +  1  -  j )]  >  P]+i[vi(j  +  1)  +  v,(n  -  j)]  >  ■  ■ 

>  /3mM[™^])  +  t;.([|]  +  l)]  >0.  (3.36) 

Proof.  First  assume  n  is  odd,  then  =  [^]  +  1.  By  (3.1),  (3.33),  (3.34)  and 

(3.35) 

=  (*,  -  +  „,(!§]  + 1)]  >  0. 
The  counter  part  of  even  n  can  be  proved  by  a  similar  fashion: 

Pf-i  -  1)  +  »i(^  +  2)]  =  0n_i  t>,(|  -  1)  +  /3?+lVi(|  +  2) 

=  Mi  -  «( )  -  0?<\(|  + !)!  +  Mi  -<■(£  +  D)».(f  +  1)  -  )] 

>  [*i  -«(£)-  0 }]M§ )  +  *i( J  +  D]  >  0?[»i(§ )  +  V,(|  +  1)]  >  0. 

We  now  use  induction  backward  to  prove  (3.36).  Assume 

&[».•(*)  +  vi(n  +  1  ~  *)]  >  0k+i[vi(k  +  1)  +  v;(n  -  fc)] 


(3.33) 

(3.34) 

(3.35) 
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for  some  fc  >  j ,  then  by  (3.1),  (3.33)  and  (3.34) 

0*-l[v«(fc  -  1)  +  v,(n  +  2  -  fc)]  =  Pk-iv i(k  -  1)  +  0n+i-kv,(n  +  2  -  k)] 

=  [(A*  -  a(k))vi(k)  -  (3kVi{k  +  1)]  +  [(A;  -  a(n  +  1  -  k))v,(n  +  1  -  fc)  -  0n-kVi(n  -  k)) 
=  (Ai  -  a(fc))v,(fc)  +  (A,  -  a(n  +  1  -  fc))«i(n  +  1  -  k)  -  0k[vi(k  +  1)  +  v,(n  -  fc)] 

=  [Ai  -  a(fc)][r,(fc)  +  t>i(n  +  1  -  fc)]  -  /3fc[t>i(fc)  +  t>,(n  +  1  -  fc)] 

=  [A^  -  a(fc)  -  /3fc][vi(fc)  +  t>i(n  +  1  -  fc)]  >  /3t+i[»i(fc  +  1)  +  Vi(n  -  fc)]  >  0. 

as  desired.  ° 

Let’s  apply  Theorem  3.10  to  the  previous  example  Jlo((2,2,0,0,0,0,0,0,2.1,2)). 
We  have  j  =  2  and  eigenvalues 

An  =  3.2467  >  An_!  =  3.1915  >  2. 

Under  the  normalization  (3.35),  Theorem  3.10  says  that 

*;( 2)  +  s,(9)  >  s,(3)  +  3,(8)  >  *(4)  +  s<(7)  >  s,(5)  +  3,(6) 

for  i=n  and  n-1.  The  numerical  values  given  above  satisfy  these  inequalities. 
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Chapter  4 

The  Minimal  Gap 


In  this  chapter  we  settle  our  minimization  problem;  to  find  the  minimal  gap  among 
the  class  of  matrices 

Cl  :=  {J(a)|a€[0,W]n} 
when  u  is  large  enough.  Formally  we  seek 

min{y(  J(a))j  0  <  a(k)  <  u  for  k  =  1, 2, . . n}, 

or  min{<7,(  J(a))|  a  G  [0,w]n  and  t  =  1,2, . . n  —  1}.  (4.1) 

The  answer  to  this  problem  is  stated  precisely  in  Theorem  4.8.  Section  4.5  gives  the  outline 
of  our  proof. 


4.1  Ericsson’s  Lower  Bound 


Consider  the  tridiagoiifl  matrix  T  of  the  form 


tridiag 


0i 


0n-i 


a(  1) 


a(n) 


(4.2) 


\  01  0n—l  ) 

with  ail  0t's  positive.  Let  the  eigenvalues  of  T  be  labeled  as  Aj  <  Aj  <  . . .  <  A„,  and 
d  =  An  -  Aj  be  the  eigenvalue  spread  of  T.  Assume  S]  =  (sj(l),s,(2), . .  ,,s,(n))  is  the  i-th 
normalized  eigenvector  of  T. 


Theorem  4.1  (Ericsson  [4])  g(T)  >  20i0j- ■  ■  0n-i/dn~2 


where  d  =  An  —  A] . 
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Proof.  Let 

X(X)=:det[XI-T)=f[(X~Xk) 

k=  1 

and  x'(A)  be  the  derivative  of  x(A)  with  respect  to  A.  Then 

n— 1 

s,(l)sj(n)x/(Ai)  =  JJ  0k,  for  i  =  1, 2, . . . ,  n.  (4.3) 

i=i 

by  Corollary  1.9  (i).  Since  the  arithmetic  mean  exceeds  the  geometric  mean, 

1  =  I l®i 1 12  =  ^  *»’(1)3  +  ^C")2  >  2  M.(lK(«)|, 

fc=X 

and  (4.3)  gives 

n(A. - At)  n  - a«) = ix'(A*)i  >  2”n  pk.  (4.4) 

t=l  i=i+l  fc= 1 

Pull  out  the  A,+i  —  Xi  term  and  bound  the  others  by  d  in  the  left  hand  side  of  (4.4),  then 

(A,+i  -  A,)  <T~2  >  2  JJ  /?*, 

t=i 

i.e.  Ai+1-Ai>2("n/3fc)/dn-2 

k=\ 

for  i  =  1,  2, . . . ,  n  -  1.  Hence  we  have  the  desired  inequality  by  (1.3).  □ 

Although  the  eigenvalues  of  T  can  be  surprisingly  close  to  each  other  as  in  If  21, 
according  to  Theorem  4.1  they  cannot  be  arbitrarily  close  when  the  entries  a(i)'s  and  3,'s 
are  bounded.  We  remark  that  Theorem  4.1  can  also  be  derived  by  Theorem  1.2,  but  Sun’s 
method  is  different  from  ours.  In  addition  we  need  the  following  refinement. 

Theorem  4.2  Given  1  <  i  <  n  —  1.  Let  m  =  min{i,n  —  t}  and  d  be  the  spread  of  T ,  then 

9i(T)  >  22m~10i02  •  •  •/?„-! /d"-2. 

Proof.  Assume  t  <  f ,  then 

n(A.-A>)  n  (a*-a.) 

j=  1  k=«+i 

=  n(A.-A>)  (A.+i-A.)nff1(Afe-A.)-  n  (a* -a,-) 

J=l  k=«  +  2  k=n-i+2 
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Therefore 


=  £[(*•  -  AiXWi  - ■  (a‘+*  -  a*)  •  "nV*  -  a.) 

)= l  k=i+2 

<  g(  Ai  -  Ai  +  A".^  -*)»  •  (A,+1  -  a.)  <r~* 

i-i  2 

<  (I)2"-’1  (>rt.  -  A.)  J"'2'  =  £  (Art.  -  A.)- 


*>2“VnW-2 

fc=l 


by  (4.4).  The  argument  for  t  >  ^  is  similar.  □ 

This  refined  theorem  says  that  the  middle  eigenvalue  separations  have  bigger  lower 
bounds  than  the  end  ones.  Thus  the  middle  separations  cannot  be  too  close  compared  to 
the  end  ones. 

Now  we  can  get  a  lower  bound  for  the  gaps  in  our  class  ££,.  The  spectrum  of  J( a) 
is  contained  in 

n— 1 

[a(l)  -  1,  a(l)  +  1)  U  y  [a(fc)  -  2,  a(k)  +  2]  U  [a(n)  -  1,  a(n)  +  1] 
k=2 

by  Gershgorin’s  Disk  Theorem.  Since  a(Jfc)  £  [0 ,  w] ,  all  the  eigenvalues  are  in  the  interval 
[~2,u>  +  2]  and  the  spread 


Therefore  by  Theorem  4.1 


d  <  uj  +  4. 


g(J)  ^  d„-i  ^  (w  +  4)«-2 


for  J  €  C-u-  Moreover  by  Theorem  4.2 


(o;  +  4)"-2  (4'5) 

for  n  >  4,  i  —  2, 3, . . . , n  —  2  and  J  6  ££,. 

Recall  that 

?->(J(Wx))  < 

by  Theorem  2.3  (iii).  Hence  the  minimum  in  (4.1)  must  be  smaller  than  — .  Thus  all  the 
middle  gf  s  in  (4.5)  are  too  large  to  consider  provided  that 


7  —  8.,n  2  >  <=>  4  >  (1  +  — )n~2 

(u>  +  4)"~2  un~ 2  us 


n-^  -  1  ' 


O  u  > 


(4.6) 
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The  right  hand  side  of  (4.6)  seems  mysterious.  Consider  6  —  then 
4s  =  es]a4  =  l  +  £ln4  +  ^(£ln4)2  +  .-->  l  +  fln4 

and 

-l6  -  1  <  6Lq4  =  kT^n  “ 2^' 

The  constant  =  2.885390082  <  2.8854.  Therefore  (4.6)  holds  if 

u  >  2.8854(n  -  2).  (4.7) 

By  more  detailed  analysis  of  the  spread  this  constant  can  be  reduced,  but  some  constraint 
like  (4.7)  is  necessary  because  of  the  counterexample  in  Table  1.2. 

We  conclude  that  the  minimum  of  (4.1)  comes  from  g\  or  gn-\  if  w  is  large  enough. 
This  is  certainly  trivial  when  n  =  3.  Recall  that 

=  gn-i(J(*m)) 

by  Corollary  1.5  (iii),  where 

a’  =  we  -  a  =  (u  -  a(l),a;  -  a(2), . . .  ,u>  -  a(n)) 

is  the  dual  of  a  with  respect  to  u.  Note  that  a*  €  [0,  w]"  if  and  only  if  a  €  [0,u/]n.  Therefore 
under  the  condition  of  (4.7),  (4.1)  is  equivalent  to 

min  {  <7n-i(^(a))|  a  e  [0,w]n}.  (4.8) 

After  the  reduction,  it  suffices  to  solve  (4.8). 

4.2  Refinement  Sun’s  Theorem 

In  this  section  we  also  consider  the  tridiagonal  matrices  T(a)  of  the  form  (4.2) 
with  positive  /3*’s.  Sun’s  Theorem  1.2  is  refined  below.  These  inequalities  say  that  A;+1  -  A, 
cannot  be  too  small  when  *  >  f  and  the  trace  of  T  is  large,  or  when  *  <  f  and  the  trace  of 
T  is  small. 

Theorem  4.3  Let  An  be  an  upper  bound  for  X„  and  A x  be  a  lower  bound  for  Ai.  Define 

9  :=  — C  a{k )  -  (n  -  l)Aj],  q  :=  [(n  -  l)An  -  a(k)], 

k=i  n  -  z  fc=1 

*  •=  E  -  (n  -  1)Ai],  r  :=  ((n  -  1)A„  -  a(fc)], 

4=2  4= 2 
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i  ■■=  +  £>(*)-(«-  Di,], 

n~  *  *  k=2 

1  a(l)  +  a(n)  ^5 

*  :=  j  K"  -  X)A"  -  -"'o'  ^  ~  Z  “(*)]. 

g  =  max{?,g},  f  =  max{r,r},  f=max{t,t}. 


i.  9i(T(a))  >  2(  n'flkMqr')*-1,  Si(T( a))  >  2(nA)/H  /or  *  =  1, 2, . . . ,  [a=l]; 

fc=i  fc=i 

«•  *(T(a))  >  2(”n  foVW)*-1,  ff,(T( a))  >  2(' Tf  &)/f"-2  /or  t  =  [Sfl], . . n- 1; 

k-i  k= i 

m.  s(T(a))  >  2(n1/3fc)/(9f)f-1,  and  s(T(a))  >  2("n  /?t)/t»'2. 


Proof.  According  to  Corollary  1.9  (iv) 

Xl,n-l(^«)X2tn(^»)  =  IJ  Pt, 
k= 1 

'where  Xj,k{X)  '•=  det[AJ  -  T,t*)  and 

/  Pj  •••  \ 

Tj.kCa)  :=  tridiag  a(j) .  a(fc)  for  j  <  k, 

\  Pj  ■■■  Pk- i  ) 

Let  {#*;}£-}  and  be  the  eigenvalues  of  and  T^<n  respectively,  in  increasing 

order.  Thus 

n— 1  n-1 

IJ(A'  -  **)(*.•“  Wk)  =  n  for  i  =  l,2,...,n.  (4.9) 

k=i  fc= i 

Replace  i  by  i  +  1  in  (4.9)  and  multiply  it  by  (4.9),  then 


nV.  -  tkX*i+i  -  hx*  -  Mfc)( a,+i  -  fit) = n 

k=i  k= i 


(4.10) 


Recall  that  Xk  <  Ok  <  ^fc+i  and  Xk  <  Hk  <  ^fc+i  for  fc  =  1,2, ...,n  —  1  by  Cauchy’s 
Interlacing  Theorem  [9,  10-1-2).  Since  the  arithmetic  mean  is  no  less  than  the  geometric 
mean,  we  have 

inf*.--**)  ff  m-x,)}^ 

k=i  *=i+i 
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<  :r^[i>i  -  »*)  +  £  («* -  AO] 

n  ~  z  k=\  k=.+l 

=  -i-[(2i-n)Ai-2»t+  2  «t] 

*»  •  i _ «  i _ •  I  i 


~  fc=l  k=.+l 

^s[(2i  -  n)Xj  —  -  2  £  0fc  +  ^  0*],  if  *  <  f 

k=i  k=i 

^j((2»  —  n)Af  +  0,  +  2  5^  0k  —  52  0k]>  if  *  >  f 

k=i+l  k=l 


l&MTu 


,n-i)  -  (n  -  1) Aj]  <  q,  if  »  <  $ 
1)A„  -  <r(Ti,„_i)]  <  g,  if  *  >  f 


n— 1  n— 1 

since  £  0*  =  tr(Ti  „_j)  =  £  <*(*)•  Thus  for  i  <  ?,  (4.10)  gives 
k=x  k= l 

ft/tf  =  (0.  -  A,)(A,+1  -  0, •)(/*.-  -  A,)(A,+i  -  Ml)  • 

k=l 

•  -1 

IK*  —  0fc)(A,+i  —  0k)(A;  —  Atfc)(A,+i  —  Hk)  • 

k=i 

n(fc  -  K){0k  ~  A.+1  )(#**  -  A, •)(/**  -  A.+i) 

k=» 

<  |^*  ~  A«  ^  At>i  —  0j  j2[^«  ~  A»  4-  A,+i  —  fij y fti-i fn-i 

=  (|)y-r*"<. 


i  «-  9i  >  2(  JI  &)/(«r')>  1  for  •  m  1,2,.. .,[2— i], 
fc=  1  1 

It  is  exactly  the  same  when  *  >  f  •  Clearly  gap 

9(T)>2(l[0k)/(qf^-1. 

k=l 

Again  use  the  inequality  of  arithmetic  and  geometric  means 

<  ^(g  +  r)  =  i, 


(g*)’  <  ^(g  +  r)  =  t. 


The  other  half  follows  in  the  same  way. 
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Theorem  4.3  is  slightly  better  than  Theorem  1.2  since 

1  _  "  1  " 

P2  = - -(nAn  -  ^2  at)  >  t  and  P3  =  ~ — r(^a*  ~  «Ai)  >  < 

n  —  2  n  —  2  , , 


Ai  <  min  {a(fc)}  and  An  >  max  (a(fc)}. 

~  *=1.2 . n  *=1,2 . n  1 

For  Wilkinson’s  matrix  W2m+\,  =  ra  +  1  and 

.  „  v  m2  +  2m 


q  =  r  =  t  = 


2m  -  1 


Hence  by  Theorem  4.3 


gn-i(W2m+l)  >  2(-^_L)2— \ 
m*  +  2m 


which  yields  1.24  x  10“15  for  Wjj  as  in  Table  1.1. 

Now  we  can  apply  Theorem  4.3  to  the  class  of  matrices  ££,. 

Corollary  4.4  Suppose  J  €  ££,  with  tr(J)  >  3a>.  If  u  >  2n,  then  g,(J)  >  for 

i  =  . .  .,n  -  1. 

Proof.  For  J  €  ££,,  A„  <  u>  +  2.  So  we  choose  Tn  =  w  +  2.  Recall  that 
A„  >  An  >  max  {a(fc)}  >  max{a(l),a(n)}  > 


'  I,,  i  \T  a(l)  +  a(n)  r- *  .... 

t  =  — —  [(n-  l)An  -  9  ~2>(*)] 

fc=2 

<  [nAn  -  a(*)]  <  (n(w  +  2)  -  3u»] 

<  — — -  [(n  -  3)u  +  2n]  <  — [(n  —  3)u;  +  u]  =  u>. 

n  —  2  n  —  2 


Thus  by  Theorem  4.3  (ii) 


for  i  =  pfl],...,!*-  1. 


9i  >  2/t"-2  >  2/w"“2. 


As  a  consequence  of  Corollary  4.4  and  Theorem  2.3  (iii), 


if  tr(J )  >  3u>  and 


ffn-i(J)  >  -rzj  >  Pn-i(^(wx)), 


u>  >  2 n. 


(4.11) 


Hence  under  the  condition  (4.11),  the  minimizer  of  (4.8)  must  have  trace  less  than  3u. 
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4.3  Unsymmetric  Diagonal 

From  Section  3.2,  Vgn_i(  J(a))  has  only  three  types  of  sign  pattern 
I.  +0, +0); 

n.  (+0,  +,+0, — ); 

HI.  +0,  +,...,+,  +0,  - 

Here  “+0”  means  the  entry  can  be  positive  or  zero. 

Since  Vyn_x  never  vanishes,  the  global  minimum  point  a  of  (4.8)  must  be  a  bound¬ 
ary  point  of  [0,w]n  which  satisfies  the  Kuhn-Tucker  condition.  Since  all  the  possible  sign 
patterns  of  gradients  are  known,  the  corresponding  minimizers  can  be  only 

I.  (w,w, . . .  ,w,  *,0, . .  .,0,  *); 

n.  (♦,(), ...,0,*,w,u,...,w); 

III.  (w, . . .  ,w,  *,  0, . . . ,  0,  *,u>, . . .  ,u>). 

Here  “  *  ”  means  the  entry  can  be  anything  in  [0,w]. 

To  be  a  minimizer,  it  is  impossible  to  have  too  many  u/s  in  the  diagonal.  Courtesy 
of  the  result  in  the  end  of  last  section,  at  most  two  u's  are  allowed  in  the  diagonal  if  ui  >  2n. 
The  possible  minimizers  are  thus  reduced  to  five  cases 

i.  a  =  (x,0,...,0,y,w,u>)  with  Vyn_,  =  (+0, +,...,+, +0, 

ii.  as(i,0,0 . 0 ,y,w)  with  Vyn_j  =  (+0,  +  ,  + . +,+0,-); 

iii.  a  =  (w,x,0,...,0,y,u>)  with  Vyn_1  =  (-,+  0,+ . +,+0,-); 

iv.  a  =  (w,w,y,0,...,0,x)  with  Vy„_i  =  +0, +,..., -f, +0); 

v.  a  =  (w,y,0,...,0,0,x)  with  Vyn_,  =  (-,  +0,  + . +,+,+0); 

We  want  to  show  that  none  of  the  unbalanced  diagonal  a  can  be  the  minimizer. 
The  following  lemma  is  crucial. 

Lemma  4.5  Assume  sn(l)sn_!(l)  >  0,  A„_x  >  2, 


a(fc)  <  a(n  +  1  -  fc)  for  k  =  1,2, . . j. 
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with  at  least  one  strict  inequality,  and 

a(k )  =  0  for  k  =  j  +  1,  j  +  2, . . . ,  n  —  j, 

for  some  integer  j  <  Then 

Sn-l(j)  3n-l(n  +  1  ~  j) 

*n(j)  sn{n+l  -j) 

Proof.  By  assumption,  we  have  at  least  one  strict  inequality  in  (3.28).  Without  loss  of 
generality,  choose  both  sn(l)  and  sn-i(l)  positive,  then 

sn(k)<sn(n  +  l-k)  forfc  =  l,2,...,[|]  (4.12) 

by  Theorem  3.8,  and 

sw-i(”  3  *)  >  0  for  odd  n,  and  sn_i(^)  +  3n-i(^  +  1)  >  0  for  even  n 
by  Theorem  3.9.  Hence  according  to  Theorem  3.10 

3n_i(;)  +  sn_i(n  +  l-;)>0.  (4.13) 

Notice  that  sn_i(j)  must  be  positive,  otherwise 

s„_i(l)  >  0,  s„_i(;')<0  and  Sn-^n  +  1  -  ;)  >  0, 


by  (4.13),  which  violates  that  sn_i  has  only  one  sign  change.  Therefore,  using  (4.12)  and 
(4.13), 

fn-ljj)  Jn-l(j)  >  Jn-l(n+  1  -  j) 

Sn(j)  sn(n  +  l-j)~  s„(n  +  1  —  j) 


In  all  the  cases  above,  A„  >  max  {a(&)}  >  u.  Suppose  w  >  3  and  n  >  3.  If 

t= 1,2 . n 


An_i  <  w  -  1,  then 


An  —  An_;  >  1  >  _2  >  Sn-1  (*^(wl ))i 


which  is  clearly  not  competitive.  Hence  in  the  sequel  we  assume  A„_i  >  u>  -  1  >  2  Also 
we  take  s„(l),  s„-i(l)  >  0. 

Obviously  case  (i)  satisfies  Lemma  4.5.  Assume  the  (n  -  2)-th  entry  of  V^„_1  is 


sn_i(n  -  2) 
»n(n  -  2) 


zero,  i.e. 
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Then  by  Lemma  4.5 

1  (3)  Jn-i  (n  —  2)  _ 

*„(3)  s„(n  —  2) 

which  is  impossible.  Therefore  the  (n  -  2)-th  entry  of  ^gn-\  must  be  positive  and  y  must 
vanish.  Case  (i)  thus  reduces  to 

a  =  (*,(),. ..,0,0, w,w)  with  V^n_!  =  (+0,+, ...,+, 

Notice  that 

sn- i(n-  1)  _ 

By  Lemma  4.5  again,  we  have 

Sn-l(2)  3n-l(n  -  1) 

sn(  2)  >  s„(n  —  1)  >l' 

a  contradiction.  We  conclude  that  there  is  no  diagonal  a  in  case  (i)  such  that  'Vgn_i(J( a)) 
satisfies  the  desired  sign  pattern. 

To  exclude  wj  from  case  (ii),  either  x  <  u)  or  y  >  0.  Assume  the  (n  —  l)-th  entry 
of  is  zero,  i.e. 

sn-i(n-  1) 

*n(»  -  1) 

Then  by  Lemma  4.5 

1  >  (3)  ^n-l(n  ~  1)  _  . 

sn{  2)  sn(n-  1)  ~  ’ 

which  is  impossible.  Thus  the  (n-2)-th  entry  of  Vyn_!  must  be  positive  and  y  must  vanish. 
Case  (ii)  then  reduces  to 


a  =  (*,0 . 0,w)  with  V$„_i  =  (+0, 


Notice  that 

Sn-\(n) 

*»(«) 

Again  Lemma  4.5  forces  a  contradiction 


,  ^  «n-l(l)  ^  »n-l(n)  ^  , 

-  s„;i)  >  sn(n)  >l’ 

In  summary  there  is  no  a  in  case  (ii),  except  wi,  with  the  desired  sign  pattern  of  V <?„_]. 

Similarly  we  assume  *  <  y  in  case  (iii).  Then  y  >  0  and  the  (n  -  2)-th  entry  of 
j  must  vanish.  In  other  words 


Jn-l(n) 

*n(") 
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By  Lemma  4.5  again  we  reaches  a  contradiction 

.  >  *w— i(l)  Jn-i(n)  _ 

"  sn( 1)  *  s„(n)  A’ 

The  counter  part  i  >  y  can  be  eliminated  similarly. 

Notice  that  (iv)  and  (v)  are  symmetric  to  (i)  and  (ii),  hence  they  can  be  dealt  in 
the  same  fashion  and  the  only  possibility  is  wi. 

Therefore  only  the  symmetric  diagonal  a  =  (u>,x,0,...,0,x,u;)  in  case  (iii)  sur¬ 
vived  our  test.  Since  we  have  ruled  out  the  situation  when  trace  >  2u  by  (4.11),  we  only 
need  to  consider  x  <  ^ .  This  will  be  discussed  in  the  next  section. 


4.4  Isolated  Minimum 


In  this  section  we  consider  (4.8)  with  a  near  wj.  Assume  n>4,  O^x^^  and 
diagonal  vector 

u(x)  =  (w,x,0,...,0,x,u>). 

We  want  to  show  Vyn_!(J(u))  has  sign  pattern 


(— i  +>  +»•••»+»+»  — )  (4.14) 

if  u  is  sufficiently  large.  Hence  J( wj)  satisfies  the  Kuhn-Tucker  condition  on  the  boundary 
of  [0,  u>]n.  Restricted  to  the  class  of  admissible  matrices  ££,,  y„_i  attains  a  relative  minimum 
at  J(wj).  Moreover  J(u(x))  can  not  be  the  minimizer  for  any  x  €  (0,yj. 

Lemma  4.6  Assume  n  >  4  and  0  <  x  <  j,  then 

i.  u  <  An_i(  J(u(x)))  <  An(7(u(x)))  <  u  + 

**■  y„_i(7(u(x)))  <  5. 

Proof.  The  Weyl’s  Monotonicity  Theorem  [5,  Corollary  4.3.3]  says  that  all  the  eigenval¬ 
ues  of  a  symmetric  matrix  increase  if  a  positive  semidefinite  matrix  is  added  to  it.  Therefore, 
by  Theorem  2.3  (ii), 

A„_i(J(u(x)))  >  An_i(J(wx))  >  v  +  u_\u}X.n  ~  -~2  >  u- 
Consider 


u'(x)  =  (w  -  x,0,.  ..,0,u;  -  x)  and  e  =  (l,l . 1). 


60 


Then  by  Theorem  2.3  (i) 

11  2  4 

A„(/(u'(x)))  <  LJ  -  X  +  -  +  7 - r—r  <  U  —  I  ■+  -  <  U!  -  X  +  - . 

Ul  —  X  {u  —  x)n~i  u  -  X  u> 

By  Monotonicity  Theorem  and  Theorem  1.4  (i) 

An(J(wi))  <  A„(J(u'(x)  +  xe))  =  A„(J(u'(x)))  +  x  <  u>  +  -. 

U) 

Obviously  (ii)  follows  from  (i).  □ 

Theorem  4.7  Assume  n  >  4  and  u  >  v^n  —  14  +  2.  Then 

i.  wj  locally  minimizes  gn-i(J( a))  over  all  a  6  [0,u]n; 

ii.  for  all  x  6  (0,^],  (w,x,...,x,u)  is  not  the  minimum  point  o/jn_1( J(a)). 

Proof.  Since  J(u)  is  centrosymmetric,  S{  is  either  symmetric  or  skew  symmetric  by 
Corollary  1.10  (i).  Hence  VgFl_1( J(u))  is  symmetric  and  it  suffices  to  prove  that  the  first 
half  of 


Vg„.x{J(u))  =  (s„(l)2  -  «„_i(1)2,3„(2)2  -  «„_i(2)2 . s„(n)2  -  s„_i(n)2). 

has  the  desired  sign  pattern  (4.14).  Recall  that 


s.(l)2  = 


n  i  a*  -  a,  i 

km  1 
*#« 


and  s,(2)2  =  -4 - - — — 


n  |Afc  -  A, | 

km  1 
k*i 


for  i  =  1, 2, . . .,  n  by  Corollary  1.10  (iii). 

Since  A„  >  A„_i  >  Afc  for  k  ~  1, 2, . . . ,  n  -  2, 


‘"(D2  = 


1 


n  \\,  -  n  ia„_,  -  A* j 


=^n-l(l)2. 


Hence  the  first  component  of  (4.15)  is  negative. 

For  the  second  entries  of  (4.15),  we  need  s„(2)2  >  s„_ i(2)2,  so 

(An  —  u;)2  >  (Aw— r  —  u>)2 


"n1  iAn  -  Afci  fi  ia„-,  -Afc|’ 


(4.15) 
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/  —  LJ  >2  ITT  An  Afc 

so  (1  + -»==!_)»  >ff(l  +  ■  »-'  ). 

A„-1  -  w  ^Tj  An—!  -  Afc 


(4.16) 


By  Lemma  4.6  (i) 


so  the  left  hand  side  of  (4.16) 


A„_i  -  w  <  An  -  u  <  — , 

LJ 


/,  ,  9n-l  ^2  ».  /,  ,  9n-lu^2  _  ,  ,  w  ,  w  .2 

(l+W^)  >(1  +  ~5_)  -  1+2,"-1  +  l6S"->- 

Gershgorin’s  Disk  Theorem  gives  the  location  of  A,,  which  is  contained  in 


(4.17) 


[~2, 2]|J[x  —  2,x  +  2]  JJ  [w  —  l,u>  +  1]. 

Since  x  <  y ,  the  interval  [a;  -  1, lj  +  1]  is  well  separated  from  the  others  if  lj  >  6.  In  this 


and  by  Lemma  4.6  (i) 


Aj  <  •  •  •  <  An_2  ^3J  +  2<  —  +  2 


An— i  -A*>u>-(—  +2)  =  —  -2 


for  k  =  1, 2, . . n  -  2.  The  right  hand  side  of  (4.16) 


<  nd+^F1^)  =(i+^r-j 


<  n,(1+^ 


w  —  4 


.  ,  2ffn— 1  \n— 2  r2(”-2)  i 

<  (exp - -)  =exp  - ~5n-l] 

lj  —  4  lj  —  4 

1  |  2<"-2),  ,  e\2(n-2W 


(4.18) 


The  last  equality  uses  Taylor  series  expansion  with  the  remainder  term  where 

«?€(0 

To  have  inequality  (4.16),  it  suffices  to  compare  the  quadratic  polynomials  of  gn_j 
in  (4.17)  and  (4.18).  Assume 

n  —  2  1 

u(u>  -  4)  <  9 

The  coefficients  of  g„-i  in  (4.17)  and  (4.18)  satisfy 

2(n  -  2)  2 u>  lj 

w-4  9  2 


(4.19) 
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Since 


2gn-i(n  -  2)  8(n  -  2)  8 

V  u  -  4  u{u  -  4)  9 


by  Lemma  4.6  (ii)  and  (4.10),  the  coefficients  of  j  in  (4.17)  and  (4.18)  obey 

e'».2(n-2).2  e8/9.2u>,,  2e8/9  2  n  ,  u2 

2  w  -  4  1  2  9  81  16 


-<T> 


We  conclude  that  (4.16),  or  equivalently  s„(2)2-sn_i(2)2  >  0,  holds  under  (4.19). 
Rewrite  (4.19) 


u>2  -  4u>  >  9(n  -  2) 


(u  -  2)2  >  9n  -  14 
u  >  \/9n  -14  +  2. 


(4.20) 


Notice  that  when  n  >  4 


>  V9  •  4  -  14  +  2  =  V22  +  2  «  6.69  >  6. 


Recall  that  the  ratio  of  s„-i(k)  and  sn(k)  satisfies 

3n— i(l)  3n— 1(2)  •Sn-i([a^~]) 

■Sn(l)  sn(2)  -Sn^2^]) 

by  Theorem  3.2.  Thus 

(4.20)  =>  5n(2)2  >  sn-i(2)2 

.  gn-l  (2)  (3) 

*n(2)  S„(3) 

=>  sn(k)2  >  sn.i{k)2  for  2,3,.  . 

Hence  (4.15)  has  the  desired  sign  pattern  by  symmetry.  □ 


4.5  Global  Minimum 

We  have  solved,  as  promised,  the  minimization  problem 

mln{g(J(&))\  a  G  [0,u>]"}  (4.21) 

for  w  large  enough:  gn-i(J( Wj))  is  the  minimum.  We  now  outline  the  whole  argument. 
One  key  idea  is  to  use  our  candidate  5„_i(J(wj))  as  an  eliminator.  Since 

5n_,(J(w1))< 
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by  Theorem  2.3  (iii),  the  minimum  in  (4.21)  must  be  smaller  than  Hence  we  can 

throw  away  all  the  J( a)  with  eigenvalue  separations  not  less  than  •  ■ 

For  the  class  of  matrices  ££,  of  interest,  their  spectra  are  contained  in  [-2,iw  +  2] 
by  Gershgorin’s  Disk  Theorem.  Thus  the  spread  d  of  each  J  G  ££,  is  less  than  w  +  4.  By 
Theorem  4.2 

9'^  -  (w  +  4)"“2  ~  un~2 

for  t  =  2, 3, . .  .,n  —  2  if 


n  >  4  and  w  >  2.8854(n  -  2).  (4.22) 

Hence  under  the  assumption  (4.22),  the  minimum  of  (4.21)  comes  from  51  or  gn-\-  The 
constant  2.8854  is  not  the  best  possible,  however,  some  condition  on  u/n  is  necessary. 

Duality,  introduced  in  Section  1.3,  shows  that  there  cannot  be  a  unique  minimizer 

since 

0i(J(a))*0„_i(J(a’)) 

by  Corollary  1.5  (iii),  where 

a*  =  we  -  a  =  (w  -  a(l),u>  -  a(2), . . .  ,w  -  a(n )) 

is  the  dual  of  a  with  respect  to  u.  Note  that  a*  G  [0,u;]"  if  and  only  if  a  G  [0,cw]n.  Therefore 
under  the  condition  of  (4.22),  (4.21)  is  reduced  to 


min  {  g„-i(J{*))\  a  G  [0, w]r>. 


(4.23) 


Now  we  have  a  smooth  functional  to  minimize. 

Another  key  idea  is  to  use  the  trace  to  eliminate  rivals  to  J(wi).  As  a  consequence 
of  Corollary  4.4,  for  J  G  ££,  with  tr(*  )  >  3 w, 


9n- l{J)  >  ^2  >  Wi)), 


provided  that 


u  >  2 n. 


(4.24) 


Again  some  condition  on  ration  w/n  is  necessary  and  (4.24)  probably  can  be  improved. 
Anyhow  the  minimizer  of  (4.23)  must  have  trace  less  than  if  we  assume  (4.24). 

The  global  minimum  of  (4.23)  must  be  a  relative  minimum  subject  to  the  constraint 
and  obeys  the  Kuhn-Tucker  condition:  the  diagonal  a  needs  to  match  the  sign  pattern  of 
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V<7„_i(  J(a)).  Since  all  the  possible  sign  patterns  of  gradients  are  known  from  Section  3.2, 
we  can  obtain  all  the  possible  minimizers. 

Under  (4.24),  the  possible  minimizers  finally  reduced  to  five  cases 


i. 

a  =  (x,0,...,0 

,y,w,u>) 

with 

^9n- 1  =  (+0)  •  -5 

+>  +0,  — ); 

ii. 

a  =  (x,  0, 0, . . . 

,0  ,y,u>) 

with 

^9n- 1  —  (  +  0,  +,  +,  . 

.  .,+,+0,— ); 

iii. 

a  =  (w,x,0, . . . 

»0,y,w) 

with 

^ 9n-l  -  (  — ,  +0,  +  ,  . 

. +0,  -); 

iv. 

a  =  (w,w,j/,0, 

...,0,x) 

with 

Vffn-i  =  (  —  t  — ,  +0,  +0); 

v. 

a  =  (w,  2/,  0,  — 

,  0, 0,  x) 

with 

^ 9n-\  ~  (  — ,  +0,  +,  . 

•  •  i  +,  +,  +0). 

All  the  cases  with  unsymmmetric  diagonal  a  are  impossible.  This  is  courtesy  of 
Lemma  4.5.  Therefore  only  the  symmetric  diagonal  u(x)  =  (u>,x,0, . .  .  ,0,x,lj)  in  case  (iii) 
survives.  By  the  requirement  of  trace  >  3u>,  we  only  need  to  consider  x  <  y . 

In  Theorem  4.7  we  show  Vg„-i(J(u))  has  sign  pattern  (  — ,  if 

n>4  and  u  >  \/9 n  —  14  +  2.  (4.25) 

Hence  among  all  u(x)’s,  only  wj  satisfies  the  Kuhn-Tucker  condition  on  the  boundary  of 
[0,w]n,  and  is  the  unique  minimizer  for  (4.8). 

Under  the  assumptions  of  u>  >  3,  (4.24)  and  (4.25),  J(wj)  uniquely  minimizes 
(4.23).  Notice  that 

2 n  >  V9U  -14  +  2  <=>  (2n  -  2)2  >  9n  -  4 

<=>  4n2  —  17n  +  18  >  0  <=>  (4 n  -  9)(ti  -  2)  >  0 

is  true  if  n  >  4.  Since  u  >  2n  >  8  >  3,  only  the  conditions  n  >  4  and  (4.24)  are  needed. 

For  (4.21),  we  have  two  minimizers  J( wt)  and  J(wj*)  if  (4.22)  and  (4.24)  hold. 
Compare  the  right  hand  sides  of  (4.22)  and  (4.24) 

2.8854(n  -  2)  >  2n  <=>  .8854n  >  5.7708  4=»  n  >  6.518. 


10.410 
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Figure  4.1:  Graph  of  tsr(n)  in  [6,50]. 

Theorem  4.8  i.  Assume  n  >  4  and  u  >  2 n,  then  wj  uniquely  minimizes  gn_i(  J(a)) 
ouer  a//  a  €  [0,w]n. 

it.  Assume  n  >  4  and u;  >  2n,  then  wx*  uniquely  minimizes  gi(J(a))  over  all  a  G  [0,o,-]n. 

Hi.  Assume  n  >  7  and  u;  >  2.8854(n  -  2),  then  and  wi‘  are  the  only  minimizers  of 
g(J{  a))  over  all  a  E  [0,w]n.  For  n  =  4,5,6,  we  need  u  >  2n. 

This  result  shows  that  /(wx)  and  J(wi’)  are  the  extremal  matrices  of  the  class 
C™,  for  u  large  enough.  More  precisely,  Theorem  4.8  requires 

u  >  2n  or  u  >  2.8854(n  —  2).  (4.26) 

However  as  mentioned,  neither  inequality  is  sharp.  For  each  n,  there  is  a  value  u  =  — (n) 
such  that 

5n_i(J(w!))  =  gn-i(J(w2)), 

for  which  J(wi)  and  J( wj)  are  both  minimizers.  Theorem  4.8  holds  for  u  >  tn(n),  and 
this  inequality  is  then  the  best  possible. 

We  have  not  been  able  to  determine  the  functional  form  of  »(n),  but  by  using 
Mathematica  with  up  to  70  decimal  digit  precision,  we  computed  »(n)  for  n  =  6,7, . .  .,50 
to  two  digits  of  accuracy.  The  graph  is  displayed  in  Figure  4.1  and  appears  to  grow  a  little 
less  than  linearly.  In  fact, 


0.21n  +  0.87  >  to(n)  >  0.21n  -  0.1 


for  6  <  n  <  50. 
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So  the  conditions  (4.26)  are  of  the  right  order  of  magnitute. 

From  Figure  4.1,  we  expect 

®(n)  — ►  oc  as  n  — »  oo. 

When  the  size  of  diagonal  spread  u>  is  fixed,  Theorem  4.8  holds  for  only  a  finite  number  of 
n’s. 


67 


Bibliography 

[1]  M.  S.  Ashbaugh  and  R.  D.  Benguria.  Some  eigenvalue  inequalities  for  a  class  of  jacobi 
matrices,  submitted  to  Linear  Algebra  and  Its  Applications. 

[2]  M.  S.  Ashbaugh  and  R.  D.  Benguria.  Optimal  lower  bound  for  the  gap  between  the  first 
two  eigenvalues  of  one-dimensional  schrodinger  operators  with  symmetric  single-well 
potentials.  Proc.  Amer.  Math.  5oc.,  105:  419-424,  1989. 

[31  A.  Cantoni  and  P.  Butler.  Eigenvalues  and  eigenvectors  of  symmetric  centrosymmetric 
matrices.  Linear  Algebra  and  Its  Applications,  13:  275-288,  1976. 

[4]  T.  Ericsson.  Private  communication,  1985. 

[5]  R.  A.  Horn  and  C.  R.  Johnson.  Matrix  Analysis.  Cambridge  University  Press,  New 
York,  1985. 

[6]  W.  Kirsch  and  B.  Simon.  Universal  lower  bounds  on  eigenvalue  splittings  for  one 
dimensional  schrodinger  operators.  Comm.  Math.  Phys.,  97:  453-460,  1985. 

[7]  W.  Kirsch  and  B.  Simon.  Comparison  theorems  for  the  gap  of  schrodinger  operators. 
J.  Fund.  Anal.,  75:  396-410,  1987. 

[8]  C.  C.  Paige.  The  computation  of  eigenvalues  and  eigenvectors  of  very  large  sparse 
matrices.  PhD  thesis,  University  of  London,  1971. 

[9]  B.  N.  Parlett.  The  Symmetric  Eigenvalue  Problem.  Prentice-Hall,  Englewood,  N.J., 
1980. 

[10]  B.  N.  Parlett.  Given  eigenvector  recurrence  revisited.  Invited  talk  at  the  Proceedings 
of  the  1st  International  SLAM  meeting,  ICIAM,  Minisymposium  in  honor  of  J.  H. 
Wilkinson,  Paris,  July  1987. 


68 


[11]  I.  M.  Singer,  B.  Wang,  S.-T.  Yau,  and  S.  S.-T.  Yau.  An  estimate  of  the  gap  of  the  first 
two  eigenvalues  in  the  schrodinger  operator.  Ann.  Scuola  Norm.  Sup.  Pisa  Ser.  4,  12: 
319-333, 1985. 

[12]  J.-C.  Sun.  On  the  minimum  eigenvalue  separation  for  matrices.  Mathematica  Numerica 
Sinica,  7(3),  1985. 

[13]  R.  C.  Thompson  and  P.  McEnteggert.  Principal  submatrices  II,  the  upper  and  lower 
quadratic  inequalities.  Linear  Algebra  and  Its  Applications ,  1:  211-243,  1968. 

[14]  B.-Y.  Wang.  The  bounds  of  minimum  eigenvalue  separation  of  some  matrices.  Mathe¬ 
matica  Numerica  Sinica,  5(2):  176-186,  1983. 

[15]  J.  H.  Wilkinson.  The  Algebraic  Eigenvalue  Problem.  Oxford  University,  New  York, 
1965. 

[16]  Q.  Yu  and  J.-Q.  Zhong.  Lower  bounds  on  the  gap  between  the  first  and  second  eigen¬ 
values  of  schrodinger  operator.  Trans.  Amer.  Math.  Soc.,  294:  341-349,  1986. 


Legal  Notice 


This  report  was  prepared  as  an  account  of  work  sponsored  by  the 
Center  for  Pure  and  Applied  Mathematics.  Neither  the  Center  nor 
the  Department  of  Mathematics  makes  any  warranty,  expressed  or 
implied,  or  assumes  any  legal  liability  or  responsibility  for 
the  accuracy,  completeness  or  usefulness  of  any  information  or 
process  disclosed. 


